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All cosmological observations to date are consistent with adiabatic, Gaussian and nearly 
scale invariant initial conditions. These findings provide strong evidence for a particular 
symmetry breaking pattern in the very early universe (with a close to vanishing order 
parameter, e), widely accepted as conforming to the predictions of the simplest realiza¬ 
tions of the inflationary paradigm. However, given that our observations are only privy 
to perturbations, in inferring something about the background that gave rise to them, it 
should be clear that many different underlying constructions project onto the same set 
of cosmological observables. Features in the primordial correlation functions, if present, 
would offer a unique and discriminating window onto the parent theory in which the 
mechanism that generated the initial conditions is embedded. In certain contexts, sim¬ 
ple linear response theory allows us to infer new characteristic scales from the presence 
of features that can break the aforementioned degeneracies among different background 
models, and in some cases can even offer a limited spectroscopy of the heavier degrees of 
freedom that couple to the infiaton. In this review, we offer a pedagogical survey of the 
diverse, theoretically well grounded mechanisms which can imprint features into primor¬ 
dial correlation functions in addition to reviewing the techniques one can employ to probe 
observations. These observations include cosmic microwave background anisotropies and 
spectral distortions as well as the matter two and three point functions as inferred from 
large-scale structure and potentially, 21 cm surveys. 
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1. Introduction 

Recent cosmological observations suggest that our universe is rather accurately 
modelled by the simplest version of a A-CDM cosmology mH The remarkable 
fact that one can model the large-scale evolution of the universe from recombination 
(redshift z ~ 10^) all the way through to the present epoch with just six adjustable 
parameters speaks of an underlying simplicity and elegance that begs for a deeper 
explanation, possibly from some fundamental physics setup. 

It is commonly accepted at present that the inflation paradigm provides such 
an explanation [6H8]. However, of the parameters of the A-CDM model- to wit, the 
baryon density Df,, the density of cold dark matter 17^, the dark energy density 
the reionization optical depth t, the scalar spectral index Ug — 1 and the amplitude of 
the primordial power spectrum A^- only the scalar tilt is typically predicted by any 
given inflationary constructiorj^ Although one can invoke some selective criteria of 
‘inference to best theory’ [5] to take the current set of observations as confirmation of 
the inflationary paradigm, it needn’t be too much of a stretch to probe the available 
observations for further evidence before declaring a particular class of inflationary 
models as the true description of the very early universe. Furthermore, given the 
large degeneracies that exist between the predictions of different models [inHH] 
(even drastically different cosmologies m) at the level of the primordial power 
spectrum, one might ask, are there any other handles on the data that increase our 
ability to discriminate between these, or even test the very inflationary paradigm 
itself? 

In part due to such considerations, a fair amount of theoretical effort has been 
devoted recently to understanding the consequences of various inflationary models 
with respect to their predictions for primordial non-Gaussianity m . If observed, the 
interaction statistics of the inflaton field offers us a window onto its self-couplings, its 
couplings to other fields and appropriately limited information about its embedding 
in some parent theory. Although many types of primordial non-Gaussianity could 
plausibly yet turn up in observations of the cosmic microwave background (CMB), 
the preliminary conclusion from the recent Planck mission |15j is that all obser¬ 
vations are consistent with the presence of only those non-Gaussianities generated 
by foregrounds and non-linearities during and after recombination [T6H2(1] . It thus 
remains a pertinent question to ask whether or not we have exhausted all the infor¬ 
mation content that is potentially available in the primordial two-point correlation 
functions. Even if primordial non-Gaussianity is ever detected, the question takes 
on added significance in light of the vast amount of data due to become available 
in the near to long term through large-scale structure [2TII23j . 21 cm observations 
and observations of the spectral distortion of the GMB ^51 [2^ . In partic- 


'Although interesting deviations from its predictions might be present at longer wavelengths with 
marginal significance [ 3 . 

"The amplitude is typically tuned to fit with observations and the other parameters are put in by 
hand a posteriori. 
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ular, the potential gain in the number of modes available to us - roughly, having 
access to information in a three-dimensional volume as opposed to a surface (the 
last scattering surface) - and potentially over a greater range of comoving scales 
will enable us to ask new questions, and old questions much more meaningfully of 
the data [HO] , 

One particular class of questions that has occupied a great deal of attention in the 
literature over the years has been concerned with whether or not there are features, 
or localized deviations from scale invariance in the primordial power spectrum. From 
the perspective of linear response theory, one can rephrase certain aspects of this 
in terms of the more basic question: are there any new characteristic scales present 
in the primordial data? Clearly when we observe the cosmic microwave background 
we see evidence for the scale of gravity (Mpi) and the characteristic mass scale of 
the mechanism that generated primordial structure (for example ~ Vo/{eMpf) 
during single field slow roll)[^ up to the order parameter e which would be fixed 
by a positive detection of primordial tensor mode^)^ The observed deviation from 
scale invariance in the spectral index (SHU |35] indicates a dynamical origin for the 
adiabatic perturbations, though directly inferring a mass scale from these depends 
on a host of model dependent priors. So we ask: might there be other scales lurking 
in the data that, if probed, could offer us hints of the scales that parametrize the 
parent theory or model in which the mechanism that generated primordial structure 
is embedded? 

At this point the reader might well be wondering whether or not state of the 
art CMB observations haven’t already conhrmed a featureless, red tilted primordial 
power spectrum to a large degree of accuracy BHa thus rendering the questions 
at hand irrelevant? In actual fact, what has been confirmed is that in a priori 
parametrizing the initial conditions of the A-CDM universe with an almost scale 
free initial power spectrum (normalized at a hxed comoving scale) with the spectral 
tilt and a possible logarithmic running as parameters to scan over (among oth¬ 
ers), one arrives at a best fit value rig « 0.96 w with no significant evidence for 
running. This is qualitatively different from saying that such a spectrum has been 
observed. CMB observations are intrinsically limited not just by cosmic variance but 
also by the theoretical priors that one has to invoke in order to analyse the data at 
all, in addition to having to contend with the unavoidable compression inherent in 
the projection of spacetime dependent inhomogeneities onto a surface of last scat¬ 
tering This intrinsically limits our ability to directly reconstruct the primordial 


“'In putative models where the big bang is emergent or the result of a bounce from a prior contract¬ 
ing phase, the amplitude is predicted by the characteristic scale of the underlying dynamics. In 
the context of a particular string construction for example, ~ /Mpi |31II32| . An analogous 
conclusion also follows in the context of cyclic universe models m- 

^'^Although this too makes assumptions of the field content of the universe between laboratory and 
inflationary scales even if these fields are completely decoupled from the inflaton sector m- 
'"Roughly, since foregrounds begin to dominate the CMB sky at multipoles around £ ~ 2000, one 
has at most access to approximately 10® pixels of data in the CMB, evidently a compression of 
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correlators from any given set of CMB observables, perhaps evident in the observa¬ 
tion that features could already be present in currently available data, but not with 
any notable significance [3S]. Given that future large-scale structure (LSS) data 
promises to recover some subset of the (volume) information content of the primor¬ 
dial power spectrum projected out by the CMB onto the surface of last scattering, 
one might hope to look for the presence of features in the primordial correlators 
with much greater significance. The prospect of extracting 3-d information on the 
primordial power spectrum at hitherto unseen scales through observations of the 
21 cm background and spectral distortions of the CMB only makes this endeavour 
even more pertinent. 

In the context of inflationary cosmology, transient stronger couplings in the ef¬ 
fective theory of the adiabatic mode, the presence of initial particles or modified 
vacua, particle production, phase transitions and modulations of the effective po¬ 
tential during inflation all furnish examples where localized features and ringing 
will be sourced in the power spectrum and higher order correlators. From these 
features, one can in principle infer characteristic mass scales which offer a window 
(with appropriate degeneracies from the perspective of the effective theory) onto 
the couplings and field content of the parent theory in which inflation is embedded. 
Characteristic scales that parametrize relaxation to the attractor, or the decay of 
any initially present particles could offer us a glimpse onto pre-inflationary dynam¬ 
ics or excited initial states if inflation lasted not much longer than required by the 
solution to the horizon problem. Additionally, the presence of localized violations of 
slow roll and features in the potential will also source features in the power spectrum 
that can in principle, be distinguished from the previously mentioned possibilities. 

In what follows, we will review the theory, the motivation and the observational 
prospects for detecting features and new characteristic scales in the primordial cor¬ 
relation functions. As mentioned, the enterprise is especially relevant given the 
wealth of data from LSS and 21 cm observations that are due or envisaged in the 
near future. The latter promises to give us access to comoving scales far beyond 
those available to us in CMB and LSS at redshifts where they remain in the linear 
regime. In combination with measurements of spectral distortions of the CMB, we 
are thus being offered a much larger window onto inflation as it progressed. 

Although the following treatment won’t be exhaustive, we hope to cover the basic 
aspects of those mechanisms that are well motivated theoretically and whose predic¬ 
tions might conceivably be tested against current and future cosmological data. The 
first half of our treatment begins in the next section by formulating a perturbative 
understanding of the origin of features in the comoving curvature power spectrum 
and bispectrum (to be precisely dehned further), denoted Vu and respectively, 
that model many physical situations of interest. From these, we hope to gain an 
understanding of which signatures to expect from a given context and whether they 
can be distinguished from each other. We then review some of the more widely 


the total information content of the inhomogeneities at last scattering by a factor of about 10®. 
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studied mechanisms that could generate features in primordial correlators and how 
these relate to their underlying fundamental physics realizations. The second half 
of the treatment commences with a review of how features in the primordial corre¬ 
lators of TZ imprint on the temperature and polarization anisotropies of the CMB 
as well as in distortions of its spectrum, after which we turn our attention towards 
observational aspects of searching for such features. Certain details of our treatment 
that require more involved elaborations are covered in the appendices. 


2. Preliminaries 

To a large degree of accuracy, the cosmological perturbations observed in the CMB 
are adiabatic in nature. One can interpret this as an indication for them having been 
generated by (i) a single degree of freedom, or (ii) through some mechanism that 
converts entropy perturbations associated with multiple degrees of freedom into 
adiabatic perturbations shortly before or (iii) after horizon exit |36) . For simplicity 
and predictability, we will focus on the former scenario in what follows, however, it 
should be clear that the basic examples that we’ll cover in the following generalize 
straightforwardly. 


2.1. The generation of curvature perturbations 


In what follows, we will set up a simple framework for understanding how features 
can be generated and imprinted in the primordial power spectrum in any given 
context. We begin with the most general quadratic action for cosmological pertur¬ 
bations in terms of the so-called Mukhanov-Sasaki (MS) variable [571155] : 


S2 = l f d^s 


-c2(Vu)2 -b —C 


where the above is defined on conformal coordinates and where 


z := a 


nc. 


( 1 ) 


( 2 ) 


In the above, primes represent derivatives with respect to conformal time r, (po the 
background inflaton field, "H := a'/a and Cg the adiabatic sound speed. There are 
multiple advantages with working with the MS variable. Chief among them is the 
fact the action for the MS variable (in conformal coordinates) is that of a canonically 
normalized scalar field on Minkowski space, and hence quantization and specifying 
the initial vacuum state is straightforward. In addition, all information about the 
background solution which we perturb around to leading order is contained in Cg 
and z. The relationship with the comoving curvature perturbatiorp^ is given by 

V = zTZ, (3) 


this review, we adopt the notation TZ for the comoving curvature perturbation, that is the 
curvature purturbation on a foliation where the perturbed momentum flux STq vanishes. In single 
field inflation, this is equivalent to the foliation where the inflaton perturbation Sep vanishes — a 
gauge choice also known as ‘unitary’ gauge, whose significance we shall discuss shortly. Recently, 











June 22, 2015 


8 

in terms of which the perturbed quadratic action (reverting to cosmological time t) 
reads 


52 


/ 


d^x o^eMp^i 




(4) 


where dots refer to cosmological time derivatives and e := —H/H^ = (()§/(2MpjiJ^). 
The advantage of working with the comoving curvature perturbation is that in the 
absence of isocurvature perturbations, it is conserved at super-horizon scales to all 
orders in perturbation theory |40j (to arbitrary order in loops) |411143] , which allows 
us to relate its value at horizon re-entry directly to its value at horizon exit during 
inflation. 

Since temperature and polarization anisotropies directly relate to TZ, all late time 
observables will be expressed in terms of correlators of these. However for purposes 
of disentangling the various sources of features that could imprint on observables 
during inflation, it is useful for the moment to continue working with the MS vari¬ 
able. It should be emphasized that 0 and 0 only assume the monotonicity of (pQ 
(i.e. that it is a good physical clock) and that the background deviates from exact de 
Sitter space''” The primary object of interest to compute in determining the CMB 


temperature and polarization correlation functions is the so-called dimensionless 
power spectrum, which for the comoving curvature preturbation is defined as 


27r^S^{k + q)Vn,^Ak) := |in i„, (5) 


with k := |fc| and where the subscripts indicate that we are interested in computing 
finite time correlation functions, requiring us to work in the Schwinger-Bakshi- 
Mahanthappa-Keldysh or ‘in-in’ formalism for shorl |''“‘[ which we assume to always 
be the case, henceforth dropping the subscripts indicating so. We review the basics 
of the in-in formalism tailored to our purposes in jAppendix A| 

For the purposes of the discussion to follow, we allow for the perturbations to 
begin at some fixed (or asymptotic) initial time in an arbitrary initial state IV’i)) 
which in most cases we will take as the Bunch-Davies state. As we shall see in 
Sect.|^ the TT angular power spectrum can be expressed in terms of its multipole 
coefficients from ^ for the low multipoles {£ < 30) [iSJ 07] as 

9 r dk 

C. = - y -^Vn,^Ak)mro - r.)], (6) 


the notation traditionally reserved for the curvature perturbation on constant energy density 
hypersurfaces has become popular notation for the comoving curvature perturbation. The two 
variables are equivalent in the context of single field inflation but will differ in general. However 
this switch in notation - popularized by m - differs from historical use and the convention in 
which observations are reported pm, to which we defer. 

'"'^Otherwise the scalar perturbations would be pure gauge. 

"^^^^For a review of this formalism in the context of computing cosmological correlators, see |44| . For 
certain caveats regarding its implementation in computing higher order cosmological correlators, 
see m- 
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where from ^ 'Pn,ipi = 'Pv,ijiilz^, where is the spherical Bessel function of order 
£ and where tq — represents the comoving distance to the last scattering surface. 

Before we proceed to discuss the generation of features in power spectrum, it 
is useful to remind ourselves how a scale invariant spectrum is generated in the 
standard slow roll inflationary scenario, the very definition of which requires that 


= = 


2M2jiJ2 


(7) 


be small (the latter equivalence follows from the Friedmann equations). Assuming 
e to be constant (certainly accurate to first order in slow roll parameters), one can 
integrate the above in cosmological time to yield H = i with c some integration 
constant that we define as c := . Integrating once more and expressing the new 

integration constant that appears in terms of the scale factor oq at t = 0 we find 

a{t) = ao(l + (8) 


Clearly when e 


0 the above is the very definition of the exponential: 
lim a{t) = ao 


£->■0 


(9) 


We stress however that (|^ is exact for any finite value of e that is constant. Con¬ 
verting to conformal time and dropping the subscript on J/, the scale factor Q 
becomes 

^ ( 10 ) 


[-TiJ(l-e)]‘'-i/2’ 


where we have defined 


3-e 


2(1-e)- 

We now presume a phase of slow roll with Cs = 1 and with e constant or small 
enough so that we can neglect higher order corrections. In this limit, the equations 
of motion that result from Q per Fourier mode are given by 


v'l + 


- 




Vk = Q 


( 12 ) 


which follows from re-expressing ([2j) as z = a-\/2eAlpi and using (10). The solution 
for the mode function requires the additional input of boundary conditions, which 
we define in the asymptotic past. Specifically, at early enough times (t —>■ —oo), a 
particular comoving scale will be deep within the horizon and for which spacetime 
will look like it effectively has the properties of Minkowski space. Requiring that all 
such modes begin in their corresponding Minkowski space vacuum means that the 
solutions must asymptote to the plane wave solution Vk ~ which fixes 

the normalization and phase of the two independent solutions of to be such 

that 




( 13 ) 
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corresponds to the mode functions of the so called Bunch-Davies vacuum state 
[ISlIin]- From the above and ([^, one finds that the mode functions for the comoving 
curvature perturbation are given by 

= (14) 


which straightforwardly reduces to the familiar expression in the limit v —> 3/2: 


TZkir) = i 


H 


1 


;(1 + ikT)e 


— ikr 


(15) 


Mpi \J 4efc3 

Given that we expand the comoving curvature perturbation around the Bunch- 


Davies vacuum I'i/'i) = |0 )b£) in terms of the Fourier modes (141 as 


% = 7^fc(T)% -f 7^/.(T)a^-, 


(16) 


where the interaction picture creation and annihilation operators satisfy the usual 
commutation relations [%,at] = {2Tr)^6^^\k—q), one can straightforwardly evaluate 
the power spectrum ([^ at late times (fcr —>■ 0“) a{3 


~ [h 

where the so called spectral tilt is given by 


ns —1 


(17) 


Us — 1 = 2/7 — 4e 


(18) 


with r] := being the familiar ‘eta’ parametei]^ In fact, since the time vari¬ 

ation of the slow roll parameters are themselves second order in slow roll quantities 
(more generally, e and rj are a subset of an infinite hierarchy of ‘slow roll’ parameters 


[SSlin]) one can understand the right hand side of ( [l^ as depending adiabatically 
on k, implying a possible running of the tilt, a situation for which there exists no 
decisive evidence to date [3S] . Thus for small enough values of the slow roll param¬ 
eters, the power spectrum is almost scale invariant. We are now equipped to turn 
our attention towards the main topic of this section- how the various mechanisms 
that could superpose features on this spectrum do so. 


‘^Where we make use of the asymptotic expansion of the Hankel function lim hI^\z) ~ 
-il^T(y){2lzr. 

^The 77 contribution to the tilt arises from the fact that once one allows for time dependence 
of the slow roll parameters, the tilt gets extra contributions that can be obtained by to first 
approximation through the mnemonic of simply evaluating the power spectrum at the time the 
comoving wave number k crosses the horizon, inducing an implicit k dependence in the e factor in 
the denominator of flTl which can be determined from the identity e = 2He{e — g). 
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2.2. Features and characteristic scales in T’'n.{k) 
We first recall the action for the MS variable Q 

S2 = IJ - c2(Vu)2 + 


where we take note of the fact that the above describes the action for an adiabatic 
perturbation around any cosmological background (not just those that are slow roll 
inflating) provided that the field cj)o furnishes a good physical clock (i.e. so that 
z never vanishes). Furthermore, we take note of the fact that to leading 

order in the effective theory (see the following subsection) all information of the 
background solution is contained in the functions Cg and z, and all information of 
the boundary conditions are contained in the initial state or more generally in 
the initial density matrix pi, usually taken to represent the Bunch-Davies vacuum. 

We now imagine a fidncial slow roll attractor solution with Cs = cq, z = zo(t). 
Retracing our steps in the last section results in the power spectrum 


rn(k) 


with the spectral index 


1 /cokY~^''° 

8 ^ 2 ^^ [Wj 


(19) 


no — 1 = 3 — 2i/o 


( 20 ) 


now corresponding to that generated by the background trajectory defined by zq- We 
assert that a wide range of situations where localised features and oscillations can be 
superposed onto such an attractor spectrum can be understood in a ‘perturbative’ 
contexlp^ That is to say, all observable correlators can be computed as perturbative 
corrections to the correlators generated on the background solution defined by cq 
and Zq. The underlying reasoning is simple - we generically expect any features that 
might be present to be smaller in magnitude than the amplitude of the spectrum 
locally. To flesh this out, in what follows, we take cq = 1 withont loss of generality, 
and imagine a localized drop in the speed of sound or localized deviations from 
the fiducial attractor solution. As we shall show in the next section, this is the 
resultant in many situations of interest (including those incorporating loop effects) 
that generate localized features. 

Consider the time dependence of the background quantities Cs(t) and z(t) as 
perturbative deviations from the fiducial quantities c = Cq and Zo(t) by defining 


w(t) := cl - clir), 

(21) 

If _ // 

W{t) 

z zo 

(22) 


^’With the exception of situations where slow roll is interrupted at intermediate epochs, see Sect, 
f^for a discussion of this possibility. 
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Doing so allows us to write the quadratic action Q in terms of a ‘free’ part and an 
‘interacting’ part as 

52 = ^ / d^x - cliVvf + + 52.i„t (23) 

with 

52,int := ^ j [w(r)(Vu)^ + JD(r)i;^] . (24) 

The presumption is that both w{t) and W(t) differ significantly from zero only 
over a finite interval, and remain perturbatively small uniformly in time as we’ll 
explicitly show to be the case over several examples in the following section. Even 
though w and W both implicitly depend on the adiabatic sound speed, for the 
purposes of computing corrections to the attractor power spectrum, we can treat 
them as independent perturbations. 

Given that the fiducial attractor solution admits the mode expansion 


^0 ^ (25) 

with the index vq now characterizing the attractor defined by zq, we can expand 
the ‘free’ part of the MS operator v in the interaction picture in terms of its mode 
decomposition as 




(26) 


where the interaction picture creation and annihilatio n operators sa tisfy the usual 
relations [%, = {27r)^— q). From the results of Appendix A we can readily 

infer that the two point correlation function of the MS variable is corrected by the 
perturbation W(t) as 

Sw{vki{T)vk 2 {T)) ={2TTfS^{ki + k 2 ) f dr' 2 W{t')‘^ 

J To 

(27) 

and by the perturbation w{t) as 

5w{vk^{T)vk^{T)) = {2TTf kl5^{ki + k 2 ) f dr 2 w{t)'^{GI^{t,t)GI^{t,t)} 

Jtq 

(28) 

where 3{...} denotes the imaginary part of, and where the ‘free field’ Green’s func¬ 
tions are given by 


GI{t,t') = jV^Hil'>{-cokT)Hlf{-cokT'). 


, . ( 29 ) 

For the special case of a fiducial de Sitter background, this is given by 


2cofc(^ cnfcr)(^“^ cnfcr') 


,-ZCofe(T-T') 


Cofcr / 


Cofcr'/ 


(30) 
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Converting to the power spectrum for the comoving curvature perturbation at late 
times’^” at which point by assumption the background trajectory will have reverted 


to the fiducial attractor solution parametrized by zq and cq, results in the following 
correction to the power spectrum 

rO 


AVn{k) = -Vn{kH^^l 


{{-T)Hll'>\-cokT)^ (31) 


with the attractor power spectrum V-jz^k) given by the expression (19), and where 
the features are induced either by the perturbations w oi W (given by (211 and (22) 
respectively), or both, as will generically be the case when the speed of sound tran¬ 
siently changes during inflation. Hence, corrections to the attractor power spectrum 
manifest as the integral transform of the interaction potentials W(t) and w{t) with 
the kernel 


K,, (r, fc) := 5 {(-r)iJ(f (-cofcr)} . (32) 

Since the fiducial background solution is presumed to be that of a slow rolling 
attractor, one can readily expand the kernel in powers of slow roll parameters such 
that to leading order one has: 


K3I2{t, k) 


^ ^ c> J 2icokT 
TTCok'^y 



(33) 


where K ^/2 is the kernel for a fiducial de Sitter attractor and subleading corrections 
can readily be computed by the expansion 


1 

= H^%{z) + l^^^(z) 


(34) 


. 2 dv 

where the first Taylor coefficient z/q — 3/2 can be expressed in terms of the attractor 


tilt by (20). For completeness, we note that 




= *7rT3/2(z) + [Ci{2z) - iSi{2z)\H^^j‘^{z) 


-H[%iz), (35) 


though it is evident that to leading order (331 suffices. Thus we arrive at the key 
expression 


= f\r 

T’n cok 


W{t) ' 
k'^w{T) 


2icokT 


Cokr 


(36) 


Some remarks are in order at this point. It should be clear that the features in¬ 
duced in the power spectrum only have finite support in k space if the interaction 


must take care in evaluating the power spectrum at late enough times (r ^ 0~) such that 
the residual evolution of quantities that are ordinarily conserved on super-horizon scales subsides 
sufficiently. Failing to do so (for example, by evaluating quantities at horizon crossing) can result 
in errors as large as the quantities we are attempting to compute, such as the spectral tilt |52II53| . 
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potentials W{t) and w{t) do not contain arbitrarily fast variations. That is, un¬ 


less the potentials have a non-zero projection onto the kernel (33) for arbitrarily 


high momenta, the features induced in the power spectrum will cut off. One can 
expect reasonable bounds on how fast the interaction potentials can vary from the 
requirement that our background solution derive consistently from an underlying 
low energy effective description. Specifically, sudden changes in the inflationary po¬ 
tential or in the speed of sound are limited by the requirements that the inflaton 
embeds itself consistently in some UV completion, which implies the validity of a 
consistent derivative expansion for the inflaton field and its fluctuations, precluding 
arbitrarily fast variations in W(t) and w(rFor example, as we shall review in 


the next section, adiabaticity requirements bound variations in the speed of sound 

as El EH 


|c,|«M|1-c2 


(37) 


where M parametrizes the scale of heavy physics that we’ve integrated out in ob¬ 
taining our low energy effective theory (EFT). 

We also wish to remark that changes in the speed of sound can reasonably 
imprint at much smaller scales in the CMB relative to localized features in the 
potential (or localized deviations off the attractor). This is evident from the 


factor that dresses the kernel (33) for the interaction potential w{t) relative to the 


interaction potential W(t) (recalling (21) and (22)). This will prove important not 


only due to the fact that a locally varying sound speed is strongly motivated from 
underlying EFT considerations, but also as localized features are much easier to 
constrain at shorter comoving scales in the CMB, as we shall see shortlj’^*'' The 


tantalizing prospects of being able to discern the matter power spectrum at scales 
far beyond those accessible in the CMB through future 21 cm observations is not 
to be taken lightly in this regard. 


For illustrative purposes, we consider the effects of localized variations in the po¬ 
tentials W{t) and w{t) that qualitatively model a background that relaxes to the 
attractor (Fig. and a transient drop in (Figs. and respectively. We take 
note of the salient differences between these examples viz. the evident suppression of 
power at large comoving scales and the ease with which varying speeds of sound can 
imprint features at shorter comoving scales without violating adiabaticity. We stress 
that all physical examples that locally imprint features (with all modes initially in 
their adiabatic vacuum state) are captured to leading order in the variations of 
the potentials w{t) and W{t). These examples include certain models that locally 


Conversely, any ‘sharp’ features in the inflaton potential or sudden drops in Cs (defined as those 
for whose derivatives become comparable to the cut-off of the theory) are unlikely to be radiatively 
stable, as quantum corrections tend to smear these out in field space or in time, respectively. 
xivThis is due to the same volume vs surface area gain in the number of modes available as noted 
in footnote [v| 
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k/Mpc ' k/Mpc ' 




k/Mpc ' k/Mpc ' 


Fig. 1. Relaxation to the attractor with W(t) = with A = 5 X 10 “®/(47r^), tq = 

— 10'^ and with fi running from 2,1, 0.5 and 0.35 in the upper left, upper right, lower left and lower 
right panels, respectively. The fiducial attractor has spectral index ris = 0.965. 


interrupt slow roll, models with localized features in the potential, situations where 
localized particle production occurs either at tree or loop level, among others. We 
will be surveying the possibilities in the following section. 

Situations in which the perturbation modes begin in states that differ from 
the adiabatic vacuum (in the case of de Sitter space, this is the Bunch-Davies 
vacuum) can further modulate features on the power spectrum. Reconsider ([^ with 
an arbitrary initial state I'lpi) 

2TT'^5^{k + q)Vn,i,Ak) ■■= in- (38) 

Were j?///) to correspond to an eigenstate of the number operator of the curvature 
quanta (or were it the result of tracing over a thermal density matrix), the result 
would be a straightforward modulation of the power spectrum 


VnMk)=VnAk)[^ + Mk)] 


(39) 


where n(fc) is the occupation number of the mode. In general, any state for 
which (a^) = (at) = 0 while (aj;at) is J-function correlatec results in this phase 


independent modulation. More generally, we might be interested in situations where 
the initial state does not satisfy these properties. Consider for example that at tj, 


Such states are particular examples of a more general class of states- the so-called “random 
phase” states— for which the simple expression |39l results. [SiiT]. 
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k/Mpc 



k/Mpc 



lO-* 0.001 0.01 0.1 1 10 

k/Mpc“' 


lO -* 0.001 0.01 0.1 1 10 

k/Mpc“' 


Fig. 2. Transient drop in Cs, modelled by w { t ) = with A = 2 X 10“^/(47r^), tq = 

— 30 and with fi running from 0.01, 0.1,1 and 5 in the upper left, upper right, lower left and lower 
right panels, respectively. The fiducial attractor has spectral index Us = 0.965. 


the initial state corresponded to some state for which mode by mode, the mode 
functions were rotated as 


= cosh TZ^ + e sinh 9^ , 

= cosh6lg7^t + sinh6lg% , 


where the transformation is defined on the hypersurface at the time tj. In this 
case, one can write the initial state as \ijji) = t/(0)|O), where U{Q) is a unitary 
operator th at w e will explicitly construct shortly. The correlator we’re interested in 
computing (38) can therefore be rewritten as {0\U^TZj:{t)U U^TZ^{t)U |0), so that it is 


equivalent to calculating it in the original adiabatic vacuum after having performed 
a unitary transformation on the TZ operators. Evidently, the interaction picture field 
that produces quanta with the wavefunctions i?g is given by 


n'{t,x) = TZ{t,x) U = 


r|3jL p 

r|3jL p ^ ^ 


(41) 




where the two sets of creation and annihilation operators relate as 
bj: = cosh sinh , 

bt = sinhdgd_j: + coshS^rdt . 


(42) 
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k/Mpc 


k/Mpc 



k/Mpc 



k/Mpc 


Fig. 3. Transient drop in Cs, modelling a so called ‘ultra-slow turn’, represented by w{t) = 
AT^(Tanh[(r — Ti)/r] — Tanh[(r — with A = 5 X 10 “®/(47 r^), = —75, Tf = —60 and with 

fi running from 0.001,0.005,0.05 and 0.1 in the upper left, upper right, lower left and lower right 
panels, respectively. The fiducial attractor has spectral index rig = 0.965. 


If the transformation (401 only rotates modes up to some fixed comoving momentum 


scale, then the transformation is unitarj’^''' and can explicitly constrncted: 


1 


U{Q) = Exp J d^k ; 0£ := , (43) 

and where one can always choose 6k > 0. One can readily check that this transfor¬ 


mation effects (42): 


= h- 


(44) 


Relating the transformation (40) in terms of the more familiar Bogoliubov coefh- 
cients 


= cosh^j: 


(45) 


/3- = e sinh 0^ , 
one can evaluate the in-in correlator of interest as 

(V>/|%(t)i^?(t)|iA/) = \n'f6Hk + cf) 

= {1 + 2|/3g|2 + 2 cos (<5^^ + 2Aj) \af:pj:\} S^k + q), (46) 


xviThe Hilbert spaces generated mode by mode by the Heisenberg algebras related by the trans¬ 
formation 1^1 are only unitarily equivalent if 9^ has finite support. That is, we require the trans¬ 
formation (42 I to tend to the identity sufficiently fast as |fc| ^ oo. If had support for arbitrarily 
large values of k (e.g. when 9}^ is a constant as is the case for the so called a—vacua |58l460p . the 
two Hilbert spaces would be unitarily inequivalent m- 
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where we’ve defined the phase |7?.^|. In position space, the power spectrum 

is obtained from 


(V'/|7^^(^,f) IV’/) = 


(27r)= 


l%ni + 2|4-P + 2cos(% + 2A,-) |a,-/3,-|} , (47) 


so that finally, we obtain the generalization of (39) to states with potentially corre¬ 
lated phases 

VnA^) = 'PnAk) {1 + 2|4H" + 2cos (<5^- + 2Ag) ■ (48) 

Therefore, in addition to the contributions that result from localized variations of 


contributions from initial state modifications: 


the potentials (|2T|) and (|22|) that result in (pM|), we have the following additional 

(49) 


AVn 


'P'R 


(fc) — 2|/3j:| -|-2 cos (Jg-|-2Ajr)|Q!j:/3j,-| 


where demanding time reversal invariance of the modified vacua imposes the con¬ 
dition = 0 |60j . Similar to the adiabaticity conditions that restrict how much 


support (36) has in fc-space, we realize that the above modulation of the power spec¬ 


trum must also decay sufficiently fast. This is because from the perspective of (1141, 


support for AVr/Vr for arbitrarily large k would imply an infinite energy density 
of particle excitations over the adiabatic vacuum. 

Having deduced the most general forms in which features can imprint on the 


primordial power spectrum in the context of single field models (36)(491, it is useful 


(especially if we are also interested in how features may imprint in higher order 
correlation functions) to ground the rest of our treatment in the framework of the 
effective theory of the adiabatic mode, to which we presently turn our attention. 


2.3. The effective theory of the adiabatic mode 

By treating all the degrees of freedom available at any given energy scale as effective 
degrees of freedom whose dynamics are governed by an effective action^ we can 
parametrize our ignorance of the fundamental theory from which these degrees of 
freedom descend in a powerful and predictive manner [62H64] . In cosmology as 
in particle physics, an effective field theory (EFT) treatment is of particular use 
in searches for ’’beyond the standard model” physics. In the context of inflationary 
cosmology, an effective treatment proves indispensable given that we have little idea 
as of yet as to what the inflaton truly represents as a physical degree of freedom. 

There are two broad classes of questions one might be interested in- those per¬ 
taining to the background and those pertaining to perturbations around this back¬ 
ground (which may not even admit a perturbative description), each with its own 
relevant operator (i.e. derivative) expansion. Questions in the former class might 
concern the naturalness of obtaining enough inflation in a given context, that is to 
say concerning loop corrections to the inflationary effective potential and kinetic 
terms, whether reheating occurs efficiently enough, questions concerning the initial 
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onset of inflation and so on. For such questions, a covariant derivative expansion- 
one that is not premised on a particular background solution (though it is premised 
on expanding in a particular basis of operators)- is of utility [S51IH]. If we presume 
that such questions have been addressed, thus taking the background solution for 
granted, questions concerning perturbations around this background and in partic¬ 
ular of the adiabatic perturbations, are usefully addressed in the EFT expansion 
of the adiabatic mode EH EH- The virtue of expanding in this particular operator 
basis is that it exploits the properties of the adiabatic mode as the Goldstone boson 
associated with spontaneously breaking time translation invariance through cosmo¬ 
logical evolution [69], providing a useful book keeping device on the expansion. 

The analysis commences with the observation that in any given spacetime where 
cosmological evolution is driven by a single effective degree of freedom, there exists a 
preferred foliation defined by surfaces of constant field value, i.e. that this degree of 
freedom serves as a good clock. Denoting this held (j), this is simply the observation 
that in any arbitrary foliation where the held and its huctuations around some 
homogeneous background solution decompose as 

(j){x,t) = (j)o{t) + 6(j){t,x), (50) 


one can make an inhnitesimal coordinate reparametrization that gauges away 
the huct uatio ns completely. Namely, the transformation t ^ t + tt such that 
TT = gauges away the (f huctuations as [37] 


(fit, x) = (fo + Scfit, x) ^ (fo + d(j){t, x) - = (/>o(t). 


(51) 


In this foliation, surfaces of constant (f coincide with surfaces of constant time, 
and the only huctuating degrees of freedom are evidently in the metric. Thus it 
appears as if what was initially a huctuating scalar degree of freedom has been 
“eaten” by the graviton, which acquires a propagating longitudinal polarization. 
This is made manifest in the ADM parametrization of the metric [70] 

ds2 = -N^df + hijidx^ + N^dt)idx^ + N^dt) (52) 

where N, are respectively the lapse function and shift vector that dehne the 
foliation and where hij is the induced three metric on the spatial hypersurfaces. 
The utility of this decomposition lies in the fact that no time derivatives of the N 
or N® appear in the action, and hence correspond to non-propagating helds that 
enforce constraints, i.e. as Lagrange multipliers. Therefore all the dynamics of the 
gravitational held is contained in the 3-metric hij, which we parametrize as 

h,j=a\t)e^'^5,j, (53) 

where we have neglected tensor perturbations, and where TZ is the non-linear gener¬ 
alization of the variable introduced by Bardeen, Steinhardt and Turner ini in linear 


^'"’^Where the dot denotes the derivative w.r.t. the time variable that defines the particular folia¬ 
tion. 
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perturbation theory, which can be shown (in the absence of non-adiabatic pertur¬ 
bations) to be conserved on super-horizon scales to all orders in perturbations [JD] 
and to arbitrary loop order I1IH13]- The variable TZ is the conformal mode of the 3- 
metric of the spatial hypersurfaces and corresponds to a local rescaling of the scale 
factor. In pure gravity, this mode is constrained not to propagate however 
this is not so once coupled to a scalar degree of freedom. In comoving gauge (the 
slicing where (j){t,x) = (j}o(t)), the conformal mode has evidently eaten the scalar 
fluctuation mode and induces a longitudinal polarization for the gravitoij^^ that 
we identify with TZ- the local rescaling of the scale factor that is equivalent to a 
local time reparametrization of the background evolution. 

Effective field theory consists of writing down all operators consistent with the 
symmetries operative at the energy and curvature scales of interest, organized as an 
expansion in operator dimensions, suppressed by powers of the scal^^)^that defines 
when new physical degrees of freedom become relevant. To any given order, the 
coefficients of these operators are to be fixed by a finite number of observations. In 


the present context, we note that the foliation defined by (51) implies that all scalar 


metric and matter fluctuations are encoded in the dynamics of the 3-metric (53) 


through the adiabatic mode TZ. Thus it must be that any action for the perturba¬ 
tions be organized as an operator expansion in powers of TZ and its derivatives that 
preserves this privileged foliation. Clearly, any combination of operators that de¬ 
scribe perturbations around the background solution that is invariant under spatial 
diffeomorphisms of the hypersurfaces are permitted to appear in the effective action. 
Cheung et al [5B] have shown that in the absence of strong gravitational effects, a 
convenient operator basis is provided by powers of -1-1) := Sg^^ = 1 — 1/N^ 
and 6Eij := Eij — E^^, where the superscripts denote unperturbed quantities and 
where 


E^j = NKij := 


{hij 'Z/iNj 'ZI jNi 


(54) 


with K,, 


being the extrinsic curvature of the foliation and where V denotes co- 

to the action that describes the 


variant derivatives w.r.t. the 3-metric hij. That is 
background solution, we are free to add arbitrary combinations of 5g^^ and 5Eij 
that are scalars under spatial diffeomorphisms, which we subsequently organize in 
powers of TZ and its derivatives. The latter results after we solve for the N and N'^ 
constraints in terms of TZ, substitute the result back into the action, and expand in 
orders of the perturbation. The action for the background is given by [68] 


Sbg = / d'^iy^ 


M. 




(55) 


xviiiThis is analogous to the process that occurs in gauge theories when the gauge symmetry is 
broken. For this reason, in this context, comoving gauge is often also referred to as unitary gauge 

f68t . 

XIXWhich counts powers of canonically normalized fields as well as derivatives. 
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This can either be viewed as the result of having re-expressed the matter content 
that induced the cosmological evolution in terms of geometrical quantities through 
the background equations of motion or simply as the unique form 

that enforces tadpole cancellation once we perturb around the background solution. 
The operators that we can add to this background action encode the specific details 
of the matter sector that induced the background cosmology. Given that and 
5Eij are zeroth and first order in derivatives respectively though both at least first 
order in perturbations, up to quadratic order, we can add any combination of the 
following operators 


-( 2 ) 


{5E])\ SE^^dE,, 


(56) 


each of which can appear with arbitrary time dependent coefhcients. Therefore, it 
might naively appear that the resulting quadratic action for TZ might depend on 
four arbitrary functions’^’' However, up to quadratic order, the last two terms in 


(56) result in contributions that differ only by an integration by parts. 

Hence only three independent functions will serve to specify the quadratic action 
once the constraints have been solved for to express everything in terms of TZ. One 
of these functions, A(t), will be the functional coefficient of the quartic operator 
generated by the {SEl)^ term [74], whereas the other two are evidently 
given by Cs{t) and e(t) (|^: 


^2 = 


d'^x a^eM? 



{dTZ)^ ^ {d'^TZf 
- s - 1" A -— 


In the discussion to follow, we only consider the first two terms in the above to 
leading order. 

Similar reasoning at the level of the cubic action, one can add any of the following 
operators 

^( 3 ) ~ iSg^°)^SEl, 5g^\5El)\ 6g°°dE^^E,,, SE^SE^^SE,,, {SElf 

(57) 

which naively implies that the cubic action can depend on six arbitrary functions. 
However just as before, up to cubic order in perturbations, the last two terms in 
the above give contributions that differ only by an integration by parts. Hence the 
cubic action depends on an additional five functions on top of cubic terms generated 


by the four independent operators listed in (56) (see [75U77] for a detailed study of 
the effects of these operators on the CMB three and four point functions). 

A great simplification results if the matter sector that induced cosmological 
evolution does not couple to the shift vector in any way (or only beyond some 


’‘’‘Although second order in perturbations, the operators generate terms up to quartic order 
in derivatives once the constraints have been solved for. One can in fact act on the operators 
j56| l with arbitrary contractions of spatial covariant derivatives to generate more higher derivative 
terms at any given order in perturbations ED, although these will be relatively suppressed by 
powers of the high energy scale that limits the validity of the theory. We do not consider this 
possibility in the counting that follows, although one can readily generalize the argument to do so. 
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order in a derivative expansion), such as would be the case if the Lagrangian of the 
matter sector had the form 




(58) 


where (j) is some scalar degree of freedom. That is, the Lagrangian depends on a 
single scalar degree of freedom and its first derivatives alone or up to some order in 
the derivative expansion. Lagrangians that fall in to this class include the so called 
A:—inflationary, or derivatively coupled scalar models naEsi and the leading terms 
of any effectively single field model up to quartic order |80]. In this case, all but the 
first operator of (57) contribute vanishingljj^^to the cubic action, which hence only 
introduces one more function- the coefficient of Denoting the coefficient of 

the ((5g°°)^ and operators and respectively, we see that for a matter 


sector of the form (58) the action up to cubic order could only depend on only 


three functions and their derivatives- e,M| and M|. For this restricted subclass, 
beginning with the action 


S = 






M, 


pr 


- - H 




M* 


..2 I H 

P> 

(59) 

we solve for the lapse function and the shift vectors N and respectively and 
substitute them back into the action to yield the quadratic and cubic actions: 


So = M, 


pi 




{dnf 

r,2 


(60) 


'S'a = I d^a: - eaTZ{dTZf + ^ - 1 + j(61) 

- 2a^di9d‘^edin + y (^ - d^9d'^9) 


where the adiabatic speed of sound is given by 

2Md 


1 = 1 - 




and whercEE] 


d^9 = - 


d^n 




(62) 


(63) 


^^’Note that this would not be true if Cm depended on higher order derivatives of 0 (such as 
powers of 00 that enter non-redundantly) as would typically be the case for an effective action 
with terms suppressed by factors greater than 1/M^ where M defines the heavy mass scale that 
limits the validity of the theory m- 

note that does not contain any factors of the scale factor 
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is the scalar part of the solution to the shift vector to linear order = diO. We 
see in particular that one function- M^it)- appears in front of several operators in 
both the quadratic and cubic actions through the precise combination that is Cg (62|. 


The consequences of this simple fact for cosmological observables will occupy us in 
Sect. 3.1. 

Before closing this subsection, we wish to highlight the fact that although not 
immediately apparent, the cubic action (61) is exactly second order in the slow 
roll parameters as an operator expansior More specifically, each non-vanishing 


cubic contribution to the finite time correlators is suppressed by two powers of e 
and/or its derivatives. One can see this at the level of the action after sufficient 
integrations by parts [33 |H2]i or by explicit calculation of finite time correlators. 
This property persists to all orders- the EFT of the adiabatic mode is an expansion 
in powers of e and its derivatives, and is trivial in the limit e —>■ 0. It is in this precise 
sense that e can be considered as the order parameter associated with spontaneously 
breaking time translation invariance. 


2.4. Features and characteristic scales in B-nik) 

Analogous to the power spectrum ([^, the object of interest to calculate for cubic 
correlation function is the so called bispectrum Bn{ki, ^ 2 , fca), defined via 

(n-,^m-Jt)Tij:^it)) = ( 2 ^) 3 j ( 3 ) BnihMM). ( 64 ) 

In a statistically isotropic universe, the bispectrum is a function of the shape and 
the size (but not orientation) of the triangle formed out of the three component 
wavevectors, so that 

Bn{ki,k2,k^) = Bn{ki,k2,k-i) ( 65 ) 


Extracting such a quantity ont of the CMB is a very challenging endeavour, so 
it is convenient to further compress the information contained in the bispectrum 
through various diagnostics for which it is easier to construct efficient estimators. 
One such diagnostic is the dimensionless /nl parameter, defined for a hxed shape 
configuration (denoted by the superscript A) as 


f^i^{ki,k2,k3) 


10 {kik2kz)^Bn 

y (2^)47^!, E.fcf 


( 66 ) 


The origin of this diagnostic follows from the fact that from a variable TZq (x) that 
obeys Gaussian statistics, one can construct a variable TZjsaix) with non-Gaussian 


xxiiiHowever, when calculating correlation functions, one typically resorts to an expansion of the 
operator 7Z in terms of normal modes, which themselves contain powers of the slow roll parameters. 
Ref. |81| proposes a redefined operator basis that consolidates this implicit dependence in powers 
of slow roll parameters. 
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statistics of the ‘local’ typj’' 
as TZg{x)) through 


(which has the same expectation value and variance 


'^NG(a:) = Ti-dx) 


^/nl {T^cd) ~ {T^od))) 


(67) 


where the factor 3/5 is convention and follows from the fact that during matter 
domination, the Newtonian potential $ and TZ relate as 77. = Such a variable 
generates a non-zero bispectrum such that in the configuration ki Q,k 2 ~ d, the 
quantity on the right hand side of (66) will evaluate to We could also consider 

k2 


(66) for the configuration where ki 


fca, which is particularly sensitive to 


non-Gaussianities of the so-called equilateral type. This diagnostic is most sensitive 
to non-Gaussianities generated by three-point interactions between (non-collinear) 
modes that exit the horizon at roughly the same time, such as that which would 
be generated by reduced adiabatic sound speeds. The reason for this is that the 
former enhances (spatial) derivative interactions, which are suppressed on super¬ 
horizon scales, and therefore contribute the most for modes that cross the horizon 
simultaneously (i.e. in the equilateral configuration). 

How then, does one generate non-Gaussianities in the first place? For economy 
of discussion, we will restrict the following discussion to the action (59) defined only 
by the independent functions e, , M^^'' It should be clear that time variations 
in M^t) generate time variations in the resulting quadratic (60), cubic (61) (and all 
higher order) contributions to the action whereas M^t) only generates contributions 
to the cubic (and higher order) terms in the action. Therefore any features in the 
power spectrum induced by time variation of (i.e. through time variations of 
Cg (62)), would induce correlated features in the bispectrum in a manner than can 
be calculated, as we do in the following section. From our previous discussion, it 
should be clear that the expectation value (64) is a finite time correlation function, 
most conveniently evaluated in the in-in formalism. From (A.4) in Appendix A this 
is given by 

rt 




k2 ' 


/C3 ' 


( 68 ) 


where Hi{t) denotes the interaction Hamiltonian constructed from (61) given the 
free action (60), where in spite of the presence of derivative interactions, the in¬ 
teraction Hamiltonian is given in terms of the interaction Lagrangian simply as 
Hi = —Li (see [44] for a discussion of this subtlety). As discussed earlier, al¬ 
though not immediately evident from (61), the cubic action is in fact suppressed 
by one more order in slow roll parameter (i.e. e and its derivatives) relative to the 


‘Local’ here refers to the fact that the non-Gaussianities are mostly due to large deviations of 
the nature of the fluctuations at a given point from that which would be expected for Gaussian 
statistics. The bispectrum in this case would peak for configurations where one of the wavevectors 
is very small. 

^^'"As we discuss in the following section, these include the leading order contributions obtained 
from integrating out heavy fields that couple to the inflaton. 
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quadratic action so that to leading order in slow roll, it is sufficient to consider only 


the first term (681 in the general expression A.4 As we shall see shortly, although 


it is straightforward (if involved) to evaluate how features generated in the power 


spectrum correlated with features in the bispectrum extracted from (68), there is a 
particular limit in which matters are greatly simplified. 

Since TZ represents the non-linear realization of time translation invariance on a 
cosmological spacetime, a non-trivial relationship must exist between the two and 
three point correlation functions (since the action of the symmetry mixes terms of 
different orders). Consider the bispectrum in the so-called squeezed limit, where 
ki —>■ 0,/c 2 ~ ^ 3 - Since ki represents a much longer wavelength mode than ^2 and 
^3 the long wavelength mode is indistinguishable from a rescaling of the background 
over which the other two modes represent perturbations as they exit the horizon. 
Therefore the result would be the same if long wavelength mode were simply factored 
out of the expectation value (now taken over the rescaled geometry) with small 
corrections that go as kf/k'^. Consequently, it can be shown that the three point 
function relates to the power spectrum in a precise manner: 


lim Bn{ki,k 2 ,k 3 ) 

/ ci —>-0 


- Vn{.ki)Vn[k^) 


dlogVnjkz) 
d log k'i 


(69) 

] and gen- 
From the 

presence of the logarithmic derivative of the last factor in (|69|) , we see that any lo- 


which is the so-called single field consistency relation first derived in 
eralized through the ‘background wave’ argument given above in [S 


calized features in Vn directly sources features at commensurate wavelengths in B-ji 
in a simple manner in the squeezed limit. Although more involved computationally, 
later sections also will consider correlated features generated for more general shape 
configurations. 

Having equipped ourselves with the requisite formalism, we now proceed to 
survey the diverse theoretical motivations for the appearance of features and new 
characteristic scales in cosmological observables. Our goal is to highlight those mech¬ 
anisms for which a concrete rationale exists from the perspective of fundamental 
theory and to emphasize the role their detection might play in learning about the 
true nature of the degree(s) of freedom responsible for generating primordial struc¬ 
ture. 


3. Theoretical motivations 

After decades of interplay between theory and observation, in cosmology as in parti¬ 
cle physics, we have successfully arrived at a phenomenological model that accounts 
for most observations of the large scale structure of the universe. We’ve dubbed this 
the "Standard Model” of cosmology, which purports that the universe began in a 
phase of primordial inflation that eventually resulted in a hot, thermalized big bang 
parametrized by a A-CDM cosmology. However, similar to the situation in particle 
physics, we are left searching for any firm indications as to the deeper theory that 
















June 22, 2015 


26 

explains the seemingly arbitrary parameters of the cosmological standard model. It 
is in this context that the search for features, if present, can be loosely likened to 
the search for physics beyond the standard model in particle physics - searching for 
’’bumps” in certain primordial correlators that might betray the existence of new 
physics through the new characteristic scales that they imply, which in some cases 
can even be traced to the existence of very massive particles that couple to the 
inflaton (see following subsection). The positive detection of any such features and 
the concomitant characteristic scales that are implied by them would in principle, 
allow us open the door further ajar on the physics underpinning the A-CDM model. 

Having oriented ourselves with an introduction to the physics of the adiabatic 
mode, we now turn to a review the diverse theoretical mechanisms that naturally 
generate features in the primordial correlators in particular realizations of inflation. 


3 . 1 . The influence of heavy fields 


Scalar fields appear to proliferate in any attempt to reconcile cosmology with fun¬ 
damental physics. In many string and super gravity theories, scalar moduli seem to 
create as many problems as they do potential solutions [84l|85] for the simple reason 
that a typical low energy description contains many of them that are massless. In 
order to obtain a low energy description that is consistent with the observed absence 
of any gravitational strength long range fifth forces, one is required to give these 
scalar fields large masses to avoid spoiling the predictions of big bang cosmology, 
as well as to render them consistent with current bounds on hfth force experiments 

one might consider 


ilHZ]- Once one identihes a mechanism for this to occuir 


looking for variants that leaves one, or more of these fields light enough to be a vi¬ 
able candidate for the inflaton. From this viewpoint, one typically expects that the 
inflaton is a (or possibly one of many) light degree of freedom in a multi-dimensional 
field space. 

Having multiple light fields participate in the dynamics of inflation represents an 
important universality class of models, known simply as multi-held inhation |931I94) . 
In this section, we consider a particular limit of these models - the effectively single 
held limit - that represents the likeliest realization of all single held models when 
we consider inhation’s embedding in a UV complete theory. That is, we consider 
the inhaton to be the lightest held in a multi-dimensional held space where all other 
helds have masses that are much greater than the scale of inhatior [’^"*[ 

The standard lore would be that if the helds that couple to the inhaton are heavy 
enough, a truncation approximation suffices and we can simply ignore the effects 


of the heavy helds (dehned as in footnote xxviil. However, this fails to be true 
in many interesting contexts [53 IM] because of the usual adage that truncating 


’‘’‘''‘See for instance ref. l88tl90l in the context of ‘flux compactiflcations’ in type II string theory, 
and refs. 19111921 for a complimentary approach in the context of Heterotic string theory. 

’‘xvi'That is to say, ^ H^, where M represents the lightest mass of any of the heavy fields 
coupled to the inflaton. 
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a degree of freedom is not always the same as integrating it out. When the field 
undergoes non-geodesic motion in field space (i.e. turns), if the radius of the turn 
starts to compare with the mass scale associated with the orthogonal (i.e. heavy) 
directions, heavy fields can become excited completely consistent with the fact that 
no heavy quanta are created EH El Ea E3 IMl- This is not too different from the 
physics of bob-sledding - a bob-sledder going around a bend in the track freely 
moves up and down the track without exciting any of the normal (i.e. vibrational) 
modes orthogonal to their trajectory. So it is possible for the inflaton to excite the 
heavy fields orthogonal to it as the trajectory bends in field space without exciting 
the (Wilsonian) fast modes, consistent with decoupling in the usual senseIt is 


possible to derive a single held effective description when the effective mass of the 
normal excitations to the trajectory satishes: 


i/2 < Ml; Ml = m2-02 


(70) 


and where 0 represents the instantaneous angular velocity of the background tra¬ 
jectory in held space. In this effective single held description, the net effect of the 
interactions of the heavy helds with the adiabatic mode are encapsulated (to leading 
order) in a reduced and potentially varying adiabatic speed of sound given by [M| 


1 


= 1 + 


40 ^ 




= 1-b 


402 


(71) 


where k is the radius of curvature of the background trajectory (pQ = k6, and its 
velocitj 


In the language of the EFT of the adiabatic mode, one can understand this as 
having generated the leading order operator (dg®®)^ after having integrated out the 
heavy degrees of freedom, with the coefficient 


M4 = 


«2M2^- 


(72) 


Higher order operators are generated, but these are typically subleading when the 
speed of sound remains close to unity. For example, it can be shown on general 
grounds |100j (and explicitly [98]) that the coefficient of the (dg™)^ term M| is 


related to as 


Mf 




(73) 


where the exact order unity constant of proportionality depends on the precise 
context. Hence, one can understand the effects of heavy fields that interact with the 


xxviiirpj^g simple reason for this is that on time dependent backgrounds, there is a misalignment 
in field space between heavy and light fields and the fast and slow modes of the theory as defined 
by the Hamiltonian |55lI8Q| . 

^^i^Which in a multi-field context is defined as := where are the field co-ordinates 

with indices raised and lowered by the sigma model metric. 
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inflaton in the effectively single field limit as inducing a reduced and varying speed 
of sound. Recalling that the speed of sound represents an independent parameter 


in the effective action for the adiabatic mode (601 (61), it shouldn’t be surprising 


that it is possible to effect this without violating slow roll m and without creating 
any heavy quanta [MIES], thus providing an EFT rationale for considering varying 
speeds of sound during single field slow roll inflation [SS] ll01Hl()4j . The effectively 
single field regime is valid so long as the parameters defining the background, namely 
e and Cs remain small in the former case and vary slowly enough in the latter. The 
precise condition for Cg can be phrased as jSH jSS] 

\cs\^M\l-c% (74) 

which serves to ensure that no heavy quanta are produced as inflation progressed 
The discussion so far has been restricted to the tree-level effective action. Although 
one can simply posit the UV completion of the theory to be such that reduced and 
varying speeds of sound result in the low energy EFT, this will require additional 
fine tuning unless we are guaranteed radiative stability in the presence of large 
derivative interactions, as is the case in contexts discussed in |l()7Hl()9j . 

One can of course, move away from the effectively single field regime, transiting 
through the ‘quasi-single field’ regime [110H113] (which we return to shortly) - where 
the mass of the heavy field approaches that of the Hubble scale during inflation - 
through to a regime where the heavy fields can be excited by sudden turns in the field 
trajectory (thus departing from adiabaticity) [10511114llll6j . or by being initially 
excited or to situations where slow roll is temporarily violated [11811120] (see 

m for bounds on excitations of heavy fields of any variety from recent data). 
However in the latter context, one has to take care that too rigorous violations of 
slow roll inflation not violate the validity of the single field effective description 
[T2^[mp^ In any of these cases, one has to account for the isocurvature modes 
that are generated and track their decay back into adiabatic perturbations, implying 
a significant model dependence to the generation of features unlike in the strictly 
adiabatic case, where due to the nature of the EFT expansion of the adiabatic mode, 
the features are governed at leading order by at most two functions e and Cg. Thus 
we restrict ourselves for the rest of the discussion in this section to this case. 


One immediately realizes from (60) and (61) that any features in the power 


spectrum will correlate with features in the higher point correlation functions as 
well. This relationship was made precise in ref. [124| in the context where inflation is 


xxxTjje condition | |74| | is equivalent to the requirement that in the parent theory (i.e. before 
integrating out the heavy degrees of freedom), the frequencies of the heavy quanta satisfy 
the usual adiabaticity condition <IC 1 , ensuring that none will be created if they weren’t 

present in the first place m- For a discussion of the conditions under which these heavy quanta 
do not align with the usual isocurvature quanta see refs |80lI1Q5| . See also ref. [T06] for a detailed 
discussion of integrating out the heavy quanta when they do align with the isocurvature directi ons . 
xxxiA situation that requiring that e and 77 remain bounded throughout in combination with \74\ 
ensures won’t be the case. 
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punctuated by transient drops in Cs encoding the effect of heavy fields 


When the 


features generated by transient drops in the speed of sound remain perturbatively 


small (in the sense that AV{k)/V 1 uniformly) the relationship (36) can be 
inverted to express the drop in the speed of sound in terms of the features generated 
in the power spectrurr|^™“|- 


Ci TT , 


k Vn 


(75) 


In a variety of contexts, the cubic action depends only on and the function Mg 
which itself depends on through (73) and an order unity coefficient that depends 


on the heavy physics that one has integrated out to arrive at the effective theory. 
Therefore it shouldn’t be surprising that changes to the bispectrum - defined in 


(64) - 


%^(t)^gjf)%^(t)) = (2^)3 j(3) BnikiMM, 


induced by the variation of Cs, can evidently be expressed entirely in terms of the 
features induced in the power spectrum. For example, when one heavy field has been 
integrated out in a two-field parent theory [98], we have M| = 3/4(1 — cJ^)M|, 
whence 

f 3fcifc2 r 1 A I ks d /AVk] 

2 ks l2k[ 2kJ Vtz ^ 4A:2dlog/t [ Vn J 

AVn 


AB-ji = 


{kik2ks)‘^ 


I kl + k2 + kl ^ 
4 kik2kz 
kik2 + ^ 2/03 4 


k r' - ^ 


'Pn 


ik) 


4 2 perm 

(76) 


2k 


kski d / AVn \ kik2k3 
d log A: V Vn ) 


f^Vn\ 


4fc2 dlogfc2 y Vn J 


k={ki+k2+k3)/2 


A commonly used diagnostic for non-Gaussianity is provided by the so called /nl 


parameters, defined for a particular shape configuration as ( 66 ) 


h i- {kik2k3fABn 

Af^i^{ki,k2,k3) = 


3 


(77) 


We thus understand the physical content of (76) as implying that any features in 
the primordial power spectrum generate correlated features, i.e. a scale dependence 
to the bispectrum that can be expressed as 


/i 


A 

NL 


A 


(fc) 


AV 


K 


4cf (A:) 


Vn 


f AV 


n 


\ Vn 


+ c^(fc) 


AV 


K 


Vn 


(78) 


turns in the inflaton trajectory in the parent theory. 
xxxiii\Ye note that this inversion is in general possible around any given background, and not just 
one corresponding to slow roll inflation. What is important is that the feature is generated by the 
variation of one parameter alone to make the inversion possible. 
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Fig. 4. /Nu/^max (Black lines) vs ^^/Amax (Blue lines) for the equilateral (right), 
folded (middle) and squeezed shapes (left) for rofc* = —11, c = 0.8 (top) and rofc* = —11, 
c = 1.5 (bottom) respectively for the ‘cosh’ drop in the speed of sound, given by = 

^ _ _ ^max _ 

COSh[c(T — To)] ‘ 


where primes denote logarithmic derivatives with respect to the comoving scale k. 
All model dependence and dependence of the particular shape configuration (but 
not its scale) are encoded in the cf"(fc) coefficients |124) . In particular, we note that 
the single field consistency relation [39l|83] implies that cf"(fc) —>■ 0 for the squeezed 
configuration, which is easily be verified to be the case in (761. In figure we 
illustrate the correlation between features in the power spectrum and the correlated 
features in the bispectrum for three shape configurations, induced by a transient 
drop in the adiabatic speed of sound. Although the /nl diagnostic defined in ([77]) 
is not optimized for oscillating features, we simply use it as a useful heuristic for 
the time being, postponing a discussion of more appropriate estimators for features 
in the bispectrum to Sect. One can readily generalize this analysis to higher 
order and the case where the slow roll parameters are taken to vary in addition 
Clearly, seeing any such correlation between features in the power 
spectrum and the bispectrum would offer compelling evidence of the nature of the 
inflaton as an effectively single, weakly coupled degree of freedom independent of the 
details of its UV completioij^^^ It should also be stressed that a positive detection 
of any such features would also permit a primitive speetroscopy of the parent theory 
in that the overall envelope of the features that can be read off from Fig. relate 
directly to the scales M and k given in Eq. (701 and (0. 


xxxivggg for a study of the analagous situation where the turns are so fast such that ( |74| | is 

no longer satisfied and adiabaticity is violated. 

^^^'"See ref. |128| for preliminary indications for such a correlation in first year Planck data. 
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In concluding this subsection we return to an important class of models - quasi¬ 
single field inflation - that sit at the limit of the scope of this review in that they 
interpolate between the single and multiple field regimes. Proposed in [1101111111129) . 
it was observed that unless forbidden by a shift symmetry, loop corrections generate 
masses of order ~ for all otherwise light helds during inflatioi j™^* | implying 
that the contribution of isocurvature modes during inflation also has to be factored 
into the computation of late time correlation functions of TZ, typically generating 
oscillatory features in the bispectrum but preserving scale invariance of the power 
spectrum (see |131j for a recent investigation into this class of models from a more 
formal point of view, and [132] for an investigation into its signatures for LSS). 
It was further suggested [13311134) that excited isocurvature modes during quasi¬ 
single field inflation could be used as independent clocks other than that provided by 
the background inflaton (since the setup is intrinsically non-adiabatic) to measure 
primordial expansion history. See |135) for the prospects of detecting these in LSS 
and CMB measurements. 


3 . 2 . Light particle content and particle production 


An important generalization of effectively single field models motivated by various 
string theoretic constructions [13611137) , are those for which the single field nature 
of the model is punctuated by regions in field space where new light degrees of 
freedom appear along the inflaton trajectory. This can happen in two ways - when 
the window in which these new degrees of freedom appear is small or large relative 
to a Hubble time. The former case is quantihed by when 


HA^ 


» 1 , 


(79) 


where is the range in field space where the additional massless degrees of freedom 
appear. The opposite case, corresponding to when the window is large relative to a 
Hubble time, is when the system essentially transits through a multifield regime - 
where the production of isocurvature modes is energetically favoured. We consider 
this regime to fall within the ambit of multi-held models (where we have to track 
the precise ways in which the isocurvature modes subsequently decay) and is not 
within the scope of this review, although it is straightforward though more involved 
to generalize the results to this case. 

When the inhaton traverses a region where the once massive degrees of freedom 
it couples to suddenly become massless, a burst of particle production takes place. 
This is most easily inferred from the expression for the frequencies associated with 
the energy eigenstates of the isocurvature quanta, which we label 'i/f, given by 


^l{k) = ((()), 


(80) 


xxxvi'pj^^g itself an interpretation in terms of thermal masses generated for all scalar fields 

with the identification (in natural units) of m ~ T = H/ {2tt) being the temperature of de Sitter 
space [130]. 
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where prime denotes derivative w.r.t. (j) and where is the effective mass |96] 

of the heavy field (whose perturbations are the isocurvature modes) which depends 
on expectation value of the inflaton fielc^^^^^ The adiabatic theorem states that 
were we to begin in a state where no heavy quanta are present, then provided 


^« 1 , 


(81) 


them no ip quanta are created through the time evolution of the background. This 
condition implies 


"S/) • 


( 82 ) 


Therefore the necessary condition for any heavy quanta to be produced is 
that their eigenfrequencies must change as inflation progresses, and that this 
change is either (i) fast enough (so that the numerator in the above becomes 
large), or (ii) that the background transits through a point where the heavy 
quanta become very light (so that the denominator becomes small). Exam¬ 
ples typically considered in the literature focus on the latter possibility, al¬ 
though the former remains an interesting avenue which to date hasn’t been 
as widely discussed in the literature. For this reason, we presently elaborate 
on a simple example that encompasses both cases in particular limits’ 


XXXVlll 


A prototypical potential that manifests 
a transient drop in the mass of the heavy 
field is illustrated in fig As the inflaton 
rolls through the point where the field ip 
becomes light, isocurvature quanta produc¬ 
tion becomes favored. Since these quanta 
are only transiently light, they decay back 
into the adiabatic quanta upon becoming 
massive again. This initial burst of isocur¬ 
vature production (followed by intermedi¬ 
ate rescatterings) and subsequent decay can 
occur either with, or without significant en¬ 
ergy transfer from the background inflaton 
field. If the backreaction is significant, the 



Fig. 5. (1 — Sech^[(/)//i]) 


xxxvii^e take note here that in general, the notion of ‘heavy’ and ‘light’ can differ from the naive 
basis of fields in which the action is expressed m- Here, and in all that follows, ‘heavy’ and ‘light’ 
distinguishes degrees of freedom whose contribution to the Hamiltonian are separated by a mass 
gap. This mass gap defines the cut-off of the low energy effective theory once the heavy fields have 
been integrated out. 

xxxviiiggg |122| for theoretical bounds placed on the amount of adiabaticity violation during infla¬ 
tion from the requirement that the underlying effective theory remain weakly coupled, a situation 
which we stay on the right side of in the examples to follow. 
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slow roll parameters characterising the background evolution can change and in 
some cases can even become transiently large - a scenario known as interrupted slow 
roll |138H141j . This effect superposes on to the effects of adiabatic-isocurvature scat¬ 
tering which occurs even when backreaction is not significant, provided only that 


the condition (81) is violated. In order to illustrate both these effects, we consider 


a system of two fields (j) (the inflaton) and '0 with a potential given by 

\l - A^sech^[(0- 0*)/^]j, 


(83) 


with Vinf{4>) being a potential that is capable of sustaining slow-roll. The mass 
term for the ip held is illustrated in hg for the case A = 1, 0* = 0. A simple 
approximation scheme when dealing with dynamical situations in an interacting 
quantum held theory is given by the so called mean held approximation, which in 
the current context simply replaces the classical held product ip'^ with the time 
dependent expectation value (0^) (suitably renormalized). Therefore the effective 
potential seen by the inhaton is given by 


V{cP) = yi„f(0) + M2(02),^ 1 - A2sech2[(0 - 0*)/^] 


(84) 


where the subscript on the expectation value denotes that in general, it is a func¬ 
tional of the background trajectory. In Appendix B we derive this correction to the 
effective potential from a functional approach and relate it to the usual Coleman- 
Weinberg correction in the limit where the backreaction on the inhaton trajectory 
is negligible. 


From (84) one can readily compute the correction to the slow roll parameters. 


and hence the background solution induced by the correction to the potential 

5V{(P) = M'^{ip‘^)Al - Xhech^iiP - 0*)//r]j. (85) 

With the dehnition e = 0o/(2MpjiJ^), we can straightforwardly infer after some 
manipulations that to leading order: 


A 9 

Se = 2e — . 


( 86 ) 


From ([^, we hnd that this results in modulations of the fnnction z = with 

Cg = 1. In terms of the effect on the perturbations (even though the system remains 
effectively single held throughout its evolution) it is informative to see how this is so 
in the regime we’re interested in from the perspective of the parent two-held theory. 


We begin with the generalization of the MS equations (19) to a two held system 

m 

{k"^ -^ A 2eH^ (3 — e) + 2'\/^V0]u^^ = 0, 


.. 2 " 


+ + 2eM^,H^Ry^ = 0, (87) 
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where := a5(j) and := where R is the Ricci scalar associated with 

the field space metric. We recall that in applying the mean field approximation 
in the above, we simply take expectation values of all powers of ip. Since {ip) = 0 
by requiring the background to be on the minimum of the potential only 


derivatives of (83) involving only no, or two derivatives of ip survive. Hence the 


mean field approximation translates (87) into: 


+ + 2ei7^(3 — e)]u^ — 0, 




+ ^eM^iH^Ry = 0, 


( 88 ) 


(89) 


Thus we see that the two perturbations effectively decouple, justifying the notion 
that this is an effectively single field model (through satisfying ( [7^ ) although like 
the examples from the previous section, not without the influence of the heavy fields 
present. Thus, we see that particle production influences the effective potential seen 
by the curvature perturbation through the correction to arising from the second 


term in (84). From (88), we find: 


+ 2ei7^(3- e)]u^ + a'^{ip'^)^f{po)v^ = 0, (90) 

where the e and /(</>) depend on the background solution po, and 

" 2A^M^ (1 — 3tanh^[((/)—(}ii*)/p,]) 


f{p) := 1 - A^sech [{p - 0*)/^] 




cosh^[(0 — p^)/p] 


(91) 


The last term in (90) can be interpreted as the correction induced by a tran¬ 


sient burst of particle production, which also modifies the background solution 
parametrized by e. Evidently, with the identification 

^ = ^_ag[y”f + 2eoi7o"(3-eo)] 

Zq Gq 

^ ^ + 2eH^{3 - e)] - a^{iP^)^f{Po) (92) 

One can infer the features generated by constructing the potential ( p^ . It is straight¬ 
forward to check that to leading order in slow roll parameters, this is simply 


W{t) = -a‘^{p^)^f{po), 


(93) 


where one assumes that slow roll hasn’t been interrupted (so that higher order 
corrections, neglected in the above, become relevant - a situation which we will 
consider in the next subsection). The detailed evaluation of ( [M| is presented in 
[Appendix B.3[ and we plot its effect on the power spectrum in figure We note 


xxxixjj^ general this is not a consistent assumption to make if the background trajectory is turning 
in field space m, however given ( |83| l, it can be shown that this is a consistent assumption to 
make even at the level of the effective action m- 
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k/Mpc ' k/Mpc ' 


Fig. 6. induced by the particle production for the potential (831, with representative 

parameters A := = H~^,X = 0.9, M = 10~^Mpi,e = 0.01 with 0 = </>» approxi¬ 

mately 4 and 3 e-folds (left and right, respectively) before the current horizon scale exited 
the Hubble radius during inflation. 


that in our derivation, the ijj quanta do not need to become exactly mass/es.^ at (/)*, 
as has typically been considered in the literature, only that the scale fj, be such that 
the adiabatic condition (82) be violated, which can happen for reasonable values 
consistent with slow roll and the existence of the appropriate hierarchies between the 
mass of the heavy quanta and H. Of course, particle production is parametrically 
enhanced when the if} quanta become exactly massless, as considered in |143H148] . 
whose results can be viewed as a particular case of the treatment elaborated upon 
here. 

An interesting generalization has also been considered in [IMIS20 where mo¬ 
tivated by string theoretic considerations, the effects of traversing multiple intervals 
in field space where massless fields appear along (or nearby) the inflaton trajectory 
was considered. In the limiting case where these regions appear frequently (while 
satisfying (79) throughout), a limiting velocity results for the inflaton due to contin¬ 
ual particle production as inflation progresses. The resulting effective potential for 
the inflaton is consequently ‘flattened’ even in cases where the tree level potential 
would not ordinarily sustain slow roll. This is but one example of potentially observ¬ 
able physical consequences the inflaton’s embedding in its (multiple field) parent 
theory. In the next subsection we review other such examples that can result in the 
appearance of new characteristic scales in cosmological observables. 


’''Which would correspond to the case A = 1 - see the figure in [Appendix B.3| for observationally 
relevant particle production for smaller values of A. 

’'*'See also for a related study in a fully multi-field context. 
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3 . 3 . Effective potential modulations and phase transitions 


Perhaps the most direct manner in which one could generate features in the power 
spectrum would be to explicitly modulate the part of the potential responsible for 
slow roll inflation itself. As we have seen in the previous section, although this can 
easily happen due to couplings of the inflaton to other fields at one loop (even those 
with masses far greater than the Hubble scale) one could also entertain such mod¬ 
ulations purely from a phenomenological perspective or motivated by a particular 
class of constructions. It is useful in this context to differentiate those potentials 
that cause localized ‘interruptions’ in slow roll (in a sense that will be made pre¬ 
cise) from those for which the slow roll parameters remain small and bounded. For 
this, we first need to distinguish the relative significance of the different slow roll 
parameters for the background evolution. Consider the definitions 


^ _ ^ ^ ^ 


(94) 


which are equivalent to the first three terms of the so-called ‘Hubble hierarchy’ 
[511 1154] of slow roll parameters. The term hierarchy follows from the fact that 


e = 2He{e - rj) (95) 

V = + r]e-0, 

and similarly for which would involve the next term in the hierarchy. In a precise 
sense e is at the top of the Hubble hierarchy in that the condition e < I determines 
the very existence of a background that is undergoing accelerated expansion. Al¬ 
though p being small is colloquially considered one of the requisites of ‘slow roll’ 
inflation, this is not strictly true as 77 ^ I merely guarantees a sufficiently long 
period of slow roll if it and e are both roughly constant (see for an illustration 
of a background where e is small and fast decreasing with p of order unity). In 
other words, e indicates whether inflation is happening at a given instant whereas 
p indicates whether inflation will continue to happen in the future if both slow roll 
parameters remain constant. As we go further down the Hubble hierarchy, it should 
be clear that the time variation of a term is partly dependent on the next order 
term. 

With these distinctions as to the significance of the terms of the Hubble hierarchy 
in mind, we define ‘interrupted’ slow-roll as realizations of inflation punctuated with 
periods where e ^ 1. That interrupted inflation should generate features in the 
primordial correlators shouldn’t come as a surprise given that by construction, it 
breaks time translation even more strongly than the quasi de Sitter background that 
gives rise to ‘almost’ scale invariant perturbations. The first instance of interrupted 
slow roll in the literature is the reference |156j , where it was realized that attempts to 
embed inflation in grand unified models could result in a series of phase transitions 
during inflation |I57j . generating an effective potential that has a series of breaks 
separating different phases of slow roll with differing Hubble rates (see |I58HI63] for 
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earlier proposals that did not necessarily invoke intermediate phase transitions to 
obtain ‘multiple inflation’, and |164j for a subsequent justification from the string 
perspective). Such models would predict primordial power spectra with sequential 
steps at those comoving scales that are exiting the horizon at the time of each phase 
transition, with oscillatory transients (see |lfi5l llbfij for early attempts to probe 
galaxy surveys in combination with pre-WMAP CMB observations for evidence 
of these). It has been argued that evidence for such features exists in subsequent 
CMB observations approaching three sigma significance [ml I140j . Interestingly, 
these transient violations of slow roll also allow for detectably large local non- 
Gaussianities |167) ~ 5 — 20) as implied by ([^, although these will require 


the implementation of estimators that are sensitive to scale dependent local non- 
Gaussianity, a topic which we will return to in the next section. The authors of 
[141] furthermore considered interrupted slow roll as a mechanism through which to 
suppress primordial power at large angular scales, which as a corollary also generates 
oscillatory features that might account for certain ‘outliers’ in the angular power 
spectrum 


Within the context of standard, uninterrupted inflation (where e remains small 
and uniformly bounded throughout) one can envisage situations where higher order 


terms in the Hubble hierarchy (e.g. ry, ^ in (95)) can become transiently larger. 


consequently generating features. In order to make quantitative predictions in this 
context, one has to account for corrections to the usual formulae for the spectral 
properties of the GMB that ordinarily assume the smallness of the higher terms in 
the Hubble hierarchy. This was first investigated in |173] , and has come to be known 
as the generalized slow roll (GSR) formalism. This formalism is necessary when 
features in the potential or the derivative couplings of the model (e.g. through non- 
canonical kinetic terms) themselves cause large jumps and deviations of the higher 
order Hubble slow roll parameters (unlike the examples studied in the previous 
two sub-sections), which might even be necessitated in certain multi-field set-ups 
m- Features from steps in the potential or in non-canonical kinetic terms that 
necessitate the use of the GSR formalism have been investigated in |174H177] for 
the power spectrum in addition to the bispectrum in |5TJ 11191 ll78H180j . The effects 
on the polarization power spectra (which as we shall see in the next section, can 
resolve features with finer resolution) was investigated in |18111182) . 

Among the more traditional genre of models where the slow roll parameters (and 
their derivatives) remain bounded and small throughout inflation, a particular class 
of models involving the use of axions has received a large amount of attention as 
means of realizing large field inflation in the context of grand unified theories and 
string theory. Whenever a theory admits one or more gauge symmetries {SU{S)c 
in the case of QGD for example), one must also a priori allow for the possibility of 


’‘'“In addition to generating potentially observable tensor non-Gaussianties over a limited range 
of scales [770111721 . 
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dimension four operators such as 




( 96 ) 


to enter the low energy effective Lagrangian. In the above, G))^, is the field strength 


associated with the gauge group with gauge coupling g, with G“j, its dual field 
strength, and 0 a constant parameter. Although classically (961 is a total derivative 
for constant 9, such a term violates CP invariance in the quantum theory unless 9 = 
0. In the context of QCD for example, the absence of an electric dipole moment for 
the neutron bounds 9 to be less than 10“^^ (see |183j for a general review of axions 
in cosmology, particularly as dark matter candidates). Quite why 9 is observed to be 
so small when it could ‘naturally’ be as large as order unity is known as the strong 
CP problem. Peccei and Quinn |I841 I185j proposed a dynamical solution to this 
problem by introducing a new spontaneously broken 17(1) symmetry, where the 
associated (pseudo) Nambu-Goldstone boson (j) couples to the CP violating term 
through the dimension five interaction 

(97) 


^ 9 ay ^ n 

7 ^ ’ 


where / is the so-called axion decay constant. The action for the nominally CP 
violating sector then becomes 




/^a /^a fiu ^ n 

32^2 CL. 


(98) 


By a well known result non-perturbative result |186) . instanton effects then induce 
an effective potential for the axion of the form 


V{(j)) =—A‘^ cos ^9 — (99) 

where A is a non-perturbatively generated mass scale. Such a potential has 4> = f9 
as a minimum, cancelling the CP violating term completely, and thus solving the 
strong CP problem. Clearly (j) ^ (p + c \s a. classical shift symmetry of the theory 
(broken only by non-perturbative effects to a discrete subgroup) since Tr G^^yG^'" is 
a purely topological term, and so it appears natural to entertain p as a candidate 
infiator (^**“[ An immediate problem in doing so is that in order to satisfy the slow 
roll conditions, one requires the axion decay constant to be such that 


/ » Mpi (100) 

which is hard to construct in a controlled setting (e.g. string theory [Ml) in addition 
to being suspect as a hierarchy at the quantum level. This motivated the authors of 
[188] to consider multiple axion fields with decay constants that are nearly (though 
not precisely) identical. In this case, the multi-field potential admits a nearly fiat 


’‘'“‘Which through expanding j99| around its minimum, can also be viewed as offering a concrete 
realization of inflation. 
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direction in the eigenbasis of the mass matrix for decay constants /i ^ f 2 , and that 
these can readily be sub-Planckian 

An entirely different approach for obtaining inflation through axions was pro¬ 
posed by the authors of m- In the context of type IIB string theory, compactifying 
extra dimensions using fluxes generated by 2-forms (see [SHI ^ review) introduces 
axions to the 4-d effective theory. The presence of (NS5) branes wrapping around 
2-cycles of the compact (Calabi-Yau) sub-manifold explicitly breaks the shift sym¬ 


metry of the corresponding axion and introduces the linear term to (99): 


V(4>) = cos { 9 — y 


( 101 ) 


with fi a mass scale derived from the details of the flux compactification, the string 
coupling and the brane tension. It is the linear part of the potential that is respon¬ 
sible for inflation, with the cosine representing small, superposed periodic modu¬ 
lations. For large enough field values, the slow roll conditions are satisfied and a 
sizeable tensor mode background is generated. In addition, the non-perturbatively 
generated part of the potential induces periodic features in the power spectrum as 
inflation progresses UMOraiQ which gives rise to a novel mechanism through 
which sizeable non-Gaussianities can be generated in single field inflation, sourced 
by a resonance between the perturbations and the oscillations of the background 
inflatorj^l^ Resonant non-Gaussianity, first observed in |199] and subsequently in 
the context brane pnn| and axion monodromy inflation is in fact a gen¬ 

eral feature of any model where a smooth slow-rolling potential is modulated by 
sinusoidal oscillations (cf. |203| for a general study of signatures of discrete shift 
symmetries during inflation, and PM] for studies of resonance in a multi-field con¬ 
text). As was first noticed in |205j . the equation of motion for the mode functions 
of TZ that follow from Q 

it+(sH+ --2—']n-%v^n = o ( 102 ) 

V e Cs J a-^ 


can transiently exhibit parametric resonance for a range of modes (that fall within 
bands where the so called Floquet index is briefly positive [206j l if either e or Cs 
vary sinusoidally in time. In plainer terms, oscillating background quantities can 
generate resonant particle prodnction of TZ quanta over the brief window where the 
physical frequency of the mode Wphys = Csk/a is commensurate with that of the 
oscillating background quantity. For the model defined by the potential (101), we 


xhvTJie authors of 11911 noted that this is a special case of a more general ‘alignment’ mechanism. 
’‘'''See 11951 for a criticism of this construction for having neglected the effects of brane-antibrane 
backreaction which induces corrections that spoil the conditions for inflation. See however CSS] 
for a construction where these corrections are under control (097] attempts to sidestep this issue 
completely by generating the linear term from an F-term rather than brane monodromy). 

addition, in certain other regimes sizeable equilateral non-Gaussianities can be generated 
even in the absence of sizeable features generated in the power spectrum USE]. 
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presume Cg = 1 and that the evolution is monotonic, which is guaranteed if 

A 1 


( 103 ) 


It can be shown [19311201j that to leading order, superposed oscillatory features 
(logarithmic in k) are generated of the form 


AVti 

'Pn 


(k) = KCOS 


4>t iog(fc/fc,) 


/ 


/ 


(104) 


where denotes the value of the background (jilt) when the comoving scale fc* exits 
the horizon, and where we have defined 

« := 36 (27r/)i/2, / := (105) 

Presumably, terms subleading in slow roll neglected in the treatment of [19311201] 
will also cause Ug to decrease logarithmically in fc, as required by the discussion 


following (36). We note in passing that it was observed in |207| that long wavelength 
features of the sort that can occur in axion monodromy models can alleviate the 
tension with the observed lack of power at large angular scales. 

Similarly, the bispectrum can be calculated as 


Bn{ki,k2,ks) = 


(27r)^A^(/c*) K 
(fci/cafca)^ 8/2 


Sin 


^og{K/k^ 


ki f\og{K/kt) 


(106) 

where A^(A;*) is the amplitude of the power spectrum at the pivot scale k^ usually 


corresponding to COBE normalization. The expression (106) represents the leading 


order expression in slow roll and the dimensionless parameter / defined in (1051, 


which according to (106) has to be small if large enough non-Gaussianity is to be 


observed. The second term in the above is consequently suppressed relative to the 


first, except in the squeezed limit (691 where its contribution ensures that the con¬ 


sistency relation is satisfied. As has been noted in [201] although the overlap; 


xlvii 


between (1061 and the standard squeezed, equilateral and orthogonal shape con¬ 


figurations are small, significant non-Gaussianity is present in other configurations, 
calling for more refined estimators for this genre of bispectra |208| . We plot various 
profiles of the bispectrum in the figure below. 

Searches for oscillatory features in the CMB two and three point functions have 
been extensive with no significant evidence in favor of them. It has 

however been argued that the use of improved estimators is required to be really 
sensitive to the bispectra generated by resonance models |208| . In addition, one 
can expect improved sensitivities to result from cross correlating with near term 
LSS surveys [HUl 121)9] . the gain in sensitivity coming from the expected gain in the 
number of modes observable at commensurate comoving scales. We shall return to 


shall discuss what we mean by overlaps between different shape configurations in more 
precise terms in Sect.j4| 
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— 

^ I 



Fig. 7. Left plot: AV-jzlV vs. for k = 0.05, / = 0.02, and similarly for the right plot: 

/i 32 , fca) , 4 ^?, . 1 where we fix ki = k* = 0.002Mpc“^ and where we define X 2 '■= k 2 /ki 
and 13 := fca/fci. The restriction of the domain follows from the triangle inequality 1 < X 2 + X 3 . 


the important question of improved sensitivity to features in the primordial two and 
three point functions through future LSS and 21 cm surveys in the next section. 


3 . 4 . Pre-inflationary dynamics 

One of inflation’s key motivations is that it purportedly accounts for the initial 
conditions of the hot big bang with few traces of the pre-inflationary state from 
which it emerged. That is, given a sufliciently long period of inflation, there will be 
no memory of the initial density matrix even if it corresponded to a state vastly 
different from the homogeneous, isotropic, thermalized initial plasma of the hot 
big ban^y™ This feature was initially coveted for its help in rendering models 


of grand unification (which generically overproduce cosmologically dangerous relics 
such as monopoles) consistent with observations, provided the reheat temperature 
was below the scale of any dangerous phase transitions. Some have even gone so 
far as to argue (in the context of chaotic, self reproducing inflation) that any given 
observer is ‘likeliest’ to find themselves in a universe that inflated for a very large 


number of e-folds, rendering the qnestion of the initial state of the universe moo1 


However, as another illustration of a recurring theme of this review, these con¬ 
siderations take for granted the manner in which inflation embeds in some candidate 
parent theory. In the context of string compactifications for example |85j it is rather 
delicate to arrange for potential that can sustain enough inflation (given arbitrary 
initial conditions) without overshooting the region where inflation is possible |216) . 
From the effective field theory perspective, it is only a restricted class of models that 


xlviiiAlthough it has been argued that inflation requires homogeneity on scales larger than the 
horizon at the time of its onset, thus assuming one of the conditions one might seek to explain 
with it I210I (see also l211H21.'il i. 

xhxcf I214I see I215I for a criticism of this reasoning. In general, all such arguments depend on a 
meaningful addressing of the so called measure problem of cosmology and should be taken with 
the appropriate disclaimers. 
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can sustain enough inflation (i.e. undergo enough excursion in field space without 
encountering regions of the effective potential that spoil inflation) [5^, the most 
common manifestation of which is the so-called eta problem (see Em for a review). 
Such considerations have led some authors to argue that if it is so hard to arrange 
for enough inflation to occur, perhaps it makes sense to conclude that it didn’t last 
much longer than is strictly necessary to solve the horizon problem |218H220] . A 
generic consequence of this, is that we’ll be witness to the dynamics of the inflaton 
close to its onset in addition to the density matrix of the universe at that epocl|^ 
It has been noted multiple times, in differing contexts that a generic suppression 
of power results for the scales that are exiting the horizon at the time of any pre¬ 
inflationary dynamics where slow roll has yet to be attained. This can be readily 
appreciated through the expression for the tilt of the spectrum for a solution that 


has settled onto an inflationary attractor (18): 


Us — 1 = 2r] — 4e. 

Recalling the definitions 77 := —cj)/{Hep) and e = one can readily show that 


(95): 


e = 2i/e(e — rj) 


(107) 


rj = H{r,^ + r,e-0; 


(108) 


so that to first order in the slow roll parameters, one can get a good first estimate of 
the envelope of the power spectrum (that is, its overall shape neglecting oscillations) 


by simply replacing in (18) the values of the slow roll parameters at the time tk at 


which the comoving scale k is exiting the horizon: 

nflk) - 1 « 2r]{tk) - 4e(4) -h ... 


(109) 


To see how any pre-inflationary (fast-roll) dynamics can suppress power for the 
scales which are exiting the horizon, we note that for decelerating solutions in an 
expanding universe, t] > 0. Furthermore we note that although e is by definition 
positive, rj can transiently be large (i.e. much larger than e) without spoiling slow 
roll if e is small enougtjj^ Therefore a background solution that is decelerating fast 
enough (but is still within the slow roll regime) ( |109 ) implies a positive tilt to the 
spectrum for the modes exiting the horizon at that time, eventually settling onto 
an attractor spectrum that characterizes the attractor, favored to be slightly red 
by observations. This implies a suppression of power at longer comoving scales. 
Any spectrum with a positive tilt that settles onto a flat, or red tilt for shorter 
comoving scales (which exit the horizon once the background has settled onto its 
attractor) necessarily has its power suppressed at longer wavelengths relative to 


'That is, any initial particle content and correlations before a long period of inflation dilutes them 
away. _ 


^This can be inferred from the formulae (1071 and (108 
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shorter wavelengths. For the 
that conversely the envelope 
wavelengths (up until scales 
necessary and this first order 
illustrative example in fig. 


tensor modes, one finds that nrik) = —2e(tk), so 
of the spectrunf^ is going to be enhanced at long 
at which higher order corrections to (109) become 
formula can no longer be trusted |222j l. We plot an 



Fig. 8. The running of the spectral indices and the exact power spectrum for motion in the 
potential , which represents a potential that can sustain slow roll 

augmented with a contribution that results in fast roll if the field is far enough to the right, where 
we begin tracking the evolution (taken from a larger class of examples studied in [223]). Exactly 
which of these modes we observe depends on the relative normalization of M w.r.t. A:*, the scale 
at which we COBE normalize. 


We stress that although the primordial power spectrum is suppressed at the largest 
scales, exactly which part of it we are privy to through our observations depends 
on when the comoving scale fcp corresponding to the present day Hubble horizon 
Hq had exited the horizon during inflation. This is because the window functions 
that project the spatial variation of TZ onto the temperature anisotropies on the 
surface of last scattering (depicted in fig. Ill peak at a scale whose normalization 
relative to the modes exiting the horizon during this pre-inflationary dynamics 
could in principle, be arbitrary. That is, were we to observe the effects of any pre- 


*“If it is ever measurable — see m\ for cosmic variance limits on the latter that depend on the 
eventual tensor to scalar ratio once primordial gravity waves are positively detected. 
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inflationary dynamics, we would need this phase to have occurred within some small 
window (typically 6-8 e-folds) of the exit the comoving scale corresponding to k^. 
This has to be conceded as a tuning, or at the very least, another cosmological 
coincidence although the latter has some motivation in the ‘just enough’ inflation 
scenario. We also wish to stress that the only observable trace of a modification 
to the power spectrum at the largest scales typified by the example of fig. would 
be in the suppression of the coefficients Cg of the angular power spectrum. Any 
oscillatory features are typically too rapid to be resolved by the broadness of the 


window function at such large scales (see fig. 111. 

Following up on an observation first made by the authors of |224) . pre¬ 
inflationary ‘fast-roll’ has been considered in various studies [22511226j as a means 
to account for the statistically marginally low quadrupole seemingly apparent in the 
WMAP data. Motivated by related (but statistically much more significant) large 
angle anomalies in the CMB [22711229]^ the authors of |218H220l 1231] argued for 
a scenario where inflation ought to have lasted not much more than necessary and 
again studied the resulting suppression at large angular scales. 

Pre-inflationary fast roll has been justified from a variety of microphysical con¬ 
siderations that include the preference for inflation to occur around inflection points 
in a single-field context in |232| and in a string theoretic (multli-field) context in 
[isg. Notably, the authors of [234] observed that the sort of potentials generated 
when supersymmetry is partly broken by the presence of D-branes and ortientifold 
planes in type II string theory, necessarily imply that in order to connect a late 
time power low inflating phase with the initial curvature singularity of the big 
bang, the scalar held must emerge from the big bang climbing up the potentia 
Subsequent investigations [22311235H237] have shown that such a pre-inflationary 
‘climbing’ phase has a qualitatively much more pronounced effect on suppressing 
power at the largest scales than monotonic relaxation to slow roll from fast roll. 
Further motivation for a phase of pre-inflationary fast-roll was provided by the au¬ 
thors of [Miiisn], who argued that the generic initial conditions for a rolling scalar 
field after it nucleates from a false vacuum generically suppress power at large scales 
for entirely analogous reasonj^ 


3 . 5 . Initial state effects? 

Having focussed on pre-inflationary dynamics in the previous subsection, it seems 
only consistent to also consider pre-inflationary initial conditions that differ from the 
usual Bunch-Davies (BD) vacuum state. Before we discuss the various theoretical 


““See [230] for forecast gains in the statistical significance of the large angle anomalies from CMB 
polarization measurements. 

hvThus the authors of [234] provide an answer to the question (often taken for granted) as to why 
the inflaton begins up the potential in the first place. 

i^See \2M\MT\ also for a study of infiationary dynamics in the context of the mini-superspace 
approximation to the Wheeler-DeWitt equation where large scale power is similarly suppressed. 
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motivations for doing so, we first address several important theoretical restrictions 
on modifying initial states that arise from considerations of obtaining a consistent 
perturbative quantization. We begin by hrst recalling the expression for the power 
spectrum ([^ with an arbitrary initial state 


27r^S\k + q)Vn,^,{k) := 


( 110 ) 


Were we recall that were j'i/;/) to correspond to a so-called ‘random phase’ state 
(such as an eigenstate of the number operator of the curvature quanta or a thermal 


state cf. the discussion in footnote xv), the result would simply be 


Vn,^Ak)=Vnfi{k)[l + Mk)] , 


( 111 ) 


where n{k) corresponds to the occupation number of the k*^ mode and where 'P'h.q 
corresponds to the power spectrum obtained from the usual BD vacuum. In gen¬ 
eral, any state which can be obtained from the BD vacuum by a mode by mode 


transformation (40) 


sinh6lg7^t 


^S = cosh0g% 

TZ'^ = cosh dg -I- sinh 6^ TZ^ , 


( 112 ) 


corresponds to a rotation of the creation and annihilation operators into the new 
canonical pair: 


= cosh6»g% 


sinh 07: 


bt = sinh0j:d_j: -I- coshdgdt , 

which are given in terms of the familiar Bogoliubov coefficients 
ag = cosh0j: , /Jg = sinhd^ . 


(113) 


(114) 


The result is the generalization of (111) to states with potentially correlated 
phasej^ 


rnAk)='PnAk){l + m, 


where we define the phase A^; via TZ^ = e 
on the power spectrum of the form 

AVn 


„ , 2cos(<5g + 2A,-) |ag/3,H} , (115) 

generating superimposed features 




ik)=m\ 


+ 2cos((5j: -1- 2Ag)|a^/3^:| 


(116) 


which represents an expression that is without loss of generality (by the complete¬ 
ness of Hilbert space), the precise details of the initial state and its phase correla¬ 
tions being encoded in the various coefficients 
caveats are in order here. 


Ivii 


in (115). However, some important 


the passage between eqs. and ( |49| l for more complete details of the derivation. 
^'^^^Given a concrete scenario for pre-inflationary dynamics, these can be explicitly calculated. 
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The first caveat is a well known feature of quantum field theory. We first note 


that the formal transformation that effects the mode by mode rotation (|113) that 

(117) 


describes the modified vacuum state |V'/) is given by (43) 


1 


(7(0) = Exp^-- / d^k [07%a_, 




, a-a -I 






So that 


IV'/) = C7(0)|O_bd), 


(118) 


where the subscript BD is to emphasize that the original vacuum is the usual 
Bunch-Davies vacuum state. One is entitled to ask what the overlap of the modified 
vacuum state is with the original vacuum, given by 

(V'/|0bd) = (O_BD|t/(0)|O_BD) (119) 

Requiring that the modified vacuum states be invariant under time reversal imposes 
the condition % = 0 [50], so that Og = 0^. One can then show that 

IV'/) = Exp 


Iviii 


— - / d^fcln cosh0^ 


Exp 


- [ d^k tanh^rat 
2 / fc 


|0 


BD 


( 120 ) 


so that the overlap between the modified vacuum state and the original BD vacuum 
is given by 

1 


(V'/|o 


BD, 


= Exp 


2 j d^/cln cosh^j: 


( 121 ) 


It is also straightforward to show that the state (120) has unit norm provided that 


d^fcln cosh0^ < oo, 


( 122 ) 


otherwise it would not be possible to describe |V’/) as a normalizable excitation 
over the vacuum \0bd)- This would imply that |V’/) exists in a different (unitarily 
inequivalent) Fock space that the original Bunch-Davies vacuum. 

The class of vacua that remain invariant under action of the de Sitter symme¬ 
try group are parametrized by the mode by mode constant rotation 9^ = a where 
a is some constant. These are the so-called ‘alpha-vacua ’ m which have been 
entertained as plausible modifications to the usual BD vacuum for inflationary cos¬ 


mology. However, it is immediately apparent that the bound (122) is violated for 


any constant rotation, implying that the differing a-vacua are unitarily inequiva¬ 
lent, wherein a serves as a super-selection parameter between them [243] . So far 
this is merely a formal statement, and one might wonder why the BD vacuum might 
be preferred at all rather than any other a-vacuum around which one can pertur- 
batively quantize the fluctuations of the inflaton? It turns out that an interacting 
field theory defined over a generic a-vacuum ground state is intrinsically non-local 


'"“See [242], in particular eqs. (2.39) - (2.39). 
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in that one generates divergences that require counter-terms involving products of 
local fields with fields at the antipodal point in de Sitter space*™ |244| . The only 


vacuum for which this is not true is that state that is invariant under the antipodal 
map, which corresponds to a = 0, or usual Bunch-Davies state. 

Therefore, a reasonable restriction on the class of modified initial states appear 
to be those for which the bound (122) is satisfied, which describes an initial state 
corresponding to a finite energy excitations over the BD vacuum. A corollary of (122) 
is that any features generated by modified initial states must have finite support in 
fc-space (i.e. the modulations to the power spectrum (49) must eventually decay). 
Physically, this is because there can only be a finite number of excited quanta up 
to a finite energy. Thought of another way, inflation by its very nature dilutes the 
universe and works to erase any memories of the pre-inflationary initial state by 
construction. Accordingly, any modified initial state (corresponding to the initial 
presence of particles with a range of momenta) will see these particles exponentially 
diluted and can only imprint in late time observables if inflation did not last too 
long (cf. the previous section), or if quanta of high enough momenta were initially 
present. The presence of the latter, clearly being bounded by the fact that one would 
like to remain in a regime where one is describing perturbative excitations around 
some slow-rolling backgrouncQ 

With these restrictions in mind, we can survey the various theoretical moti¬ 
vations for considering modified initial states. The question rose to prominence 
primarily in the context of the so-called ‘trans-Planckian’ problem of inflationary 
cosmology |249[ I250j . It was noted there that if one were to extrapolate the physical 
scales that we observe in the CMB back through 60 e-folds worth of expansion, 
a large fraction of the modes observed today would have once been in the super- 
Planckian regime. It then seems a relevant question to ask, could these modes be 
sensitive to physics above the Planck scale? After a spirited discussion in the liter¬ 
ature how best to answer this question crystallized around the issue of 

how one chooses to parametrize our ignorance of the evolution of the fluctuation 
modes the trans-Planckian regime 


Ixi 


One approach (borrowing from analogous situations in condensed matter sys¬ 
tems) would be to model evolution in the regime where unknown physics completes 
the description of the dynamics with modified dispersion relations that encodes 
this physics. The motivation is straightforward - massless quanta in a continuous 


''’'Recalling that the usual FRW coordinate system used to describe de Sitter space only covers 
half of the spacetime manifold (see CS] for a discussion of various aspects of physics on de Sitter 
spacetimes). 

^^This has frequently been phrased as requiring that the modified initial states do not ‘back-react’ 
substantially on the background geometry to spoil its quasi de Sitter nature 

analogue of this question had previously appeared in the context of Hawking radiation, where 
an observer at asymptotic infinity will observe photons from the tail of the black body distribution 
that will have been redshifted from initially trans-Planckian values. In this context, thermalization 
proves to be efficient in erasing any traces of non-adiabatic evolution in the trans-Planckian regime 
[252] (modelled by modified dispersion relations) USD. 
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^phys(^) ^phys(^) 




Fig. 9. Examples of phenomenological modified dispersion relations that violate the adiabatic 
approximation above fcc (left) and those that don’t (right). All wave vectors are in physical units 
^phys := fc/ci, where k is the comoving wave vector. 


medium obey the linear dispersion relation 

uj^{k) = (123) 

whereas once modes start to approach the scale of where the continuum description 
stars to break down (such as approaching the inter site separation in the context of 
a lattice) the linear relationship acquires a convexity, 

w^(fc) = (? sin^ (k/kc)- (124) 


where kc denotes the characteristic inverse wavelength associated with the micro¬ 
physics of the system (e.g. a lattice spacing). Therefore one might phenomenolog¬ 
ically consider different dispersion relations to see what their effects might be on 
CMB observables. 

It turns out, perhaps unsurprisingly, that any modified dispersion relations that 
do not violate adiabiticty (defined as ^ 1) in the super-Planckian regime 

renders the modes as they emerge into the sub-Planckian regime in their usual 
adiabatic (BD) vacuum state [2511125211255) . and thus do not affect CMB observables 
in any significant waj Specifically, modulated corrections to the power spectrum 


appear but are suppressed by a factor of where Me is the characteristic 

scale of the trans-Planckian physics, and is thus most likely to remain unobservably 
small. However dispersion relations that violate adiabaticity in the trans-Planckian 
regime can result in calculable (logarithmic) oscillations in the power spectrum. One 
conceivable manner in which the latter is possible is for dispersion relations of the 
type illustrated in the left hand plot of fig. which requires a concave regime for 
the dispersion relation |265) (at horizon crossing, fcphys = H so that adiabaticity is 
typically violated at a; « H). However, absent any concrete model where this occurs 
in a controlled manner, one must take such dispersion relations with a grain of salt as 


ixiiTjjg arguments presented in these references trace the standard logic for when one can construct 
the adiabatic vacuum on a time dependent background, the details for which can be found in m- 
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k/Mpc“^ 


Fig. 10. Features generated by a model where H{ko)/Mc ~ 0(10 ^), with kg = 10 ^Mpc ^ 
[256]. 


they typically signal a transition among the propagating quanta rather than a bona 
fide modification to the dispersion relation in itsel: *’'‘“ A general context in which 


modified dispersion relations finds theoretical grounding is in the of high energy 
Lorentz invariance violation, either considered phenomenologically |267H269] or in 
the context of a specific construction (such as Hofava gravity [270H273j l. Related 
contexts where short distance modifications to mode propagation lead to Lorentz 
invariance violation as a corollary includes non-commutative geometry |274H28^ 
and inflation in the presence of a minimal length scale 

Another approach one could take to parametrize our ignorance of how modes 
propagate in the trans-Planckian regime appeals to the tried and tested workhorse 
of effective field theory |253H2551125811260] . Ordinarily, one might expect the effects 
of any physics beyond the cutoff of our description to be encoded in terms of higher 
dimensional operators expressed in the usual derivative expansion [2531125411258] , 
seemingly drawing into question the need for any novel formalism. However, this 
ends up shuffling the issue from modifying the dynamics of mode propagation into 
the initial conditions, wherein the trans-Planckian question is now phrased in terms 
of requiring a specification of initial conditions with super-Planckian resolution. An 
EFT approach that seeks to address this invokes the so-called boundary effective 
field theory formalism [2961129711304] , whereby an arbitrary set of initial conditions 
are parametrized by a boundary action defined on a so-called ‘New Physics Hyper¬ 
surface’ (NPH) [305] that consists of the usual derivative expansion but now with 


ixiiiFor instance, liquid helium-4 in a supercooled phase exhibits such a dispersion relation, although 
its interpretation is in terms of a ‘second sound’ where one transitions from a phase where standard 
phonons propagate to one where hypothesized ‘rotons’ are the dominant propagating modes 1^ . 
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fields restricted to the boundary. These boundary operators can be used to construct 
arbitrary initial states, and allows one to systematically calculate the effects on 
CMB observables. It should be stressed that the NPH need not be a hyper-surface 
at some fixed initial time and could even correspond to one for which different 
comoving modes are ‘initialized’ at differing coordinate times. The net result in this 
context as well is that the effects of trans-Planckian physics generically results in 


modulations of (1161 where to leading order 


1 


(125) 


with phases that vary logarithmically in k due to the fact that the mode functions 
always depend on the combination k/(aH) and that H decreases during slow roll 


(see for details). Therefore, we find that the second term in ( |116[ ) contributes 
a logarithmically oscillating contribution to the power spectrum with an overall 
amplitude of \f3k\ ~ H/Mc (also decreasing logarithmically during slow roll) which 


can realistically result in percent level modulations (Fig. 101. These appear to be on 


the threshold of detectability (in a sense that will be made precise in the following 
sections), although no significant evidence for these have appeared so far in WMAP 
[30611307] and Planck data 


4. Observables: measuring spatial fluctuations 

In the previous section, we saw that there are a wide range of theoretical scenarios 
that can lead to features in the primordial power spectrum, which begs the question: 
are any of these realised in nature? Addressing this requires us to confront the 
theoretical predictions with experimental evidence. However, our quest for features 
is not completely straightforward - nature has endeavoured to muddle the waters 
in several nefarious ways. 

First of all, the quantities predicted by theory, i.e. the statistical properties of 
the primordial curvature perturbation are not directly observable; we can neither 
access anything primordial, nor can we “see” curvature perturbations. At best, 
we can gain indirect information by measuring derived quantities, such as matter 
density perturbations or anisotropies in the cosmic microwave background that have 
evolved from the initial state. In general, the relation of the primordial spectrum 
to actual observables depends on additional cosmological parameters unrelated to 
inflation (e.g. the baryon and dark matter densities, the Hubble parameter, the 
optical depth at recombination etc.) whose effect may, in some cases, be confused 
with a feature. 

Secondly, the generation of initial perturbations is a stochastic process, and 
quantities like the primordial curvature perturbation or the matter density fluctua- 


ixiv^g should also highlight several works that further consider features and enhancements to the 
three point function of the adiabatic mode from modified initial states l30844,Slll that in certain 
contexts, can also be arranged to preserve the scale invariance of the two point function I312II3131 . 
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tion are random fields. Exact, deterministic theoretical predictions of, for example, 
the power spectrum of these fields can only be made for ensemble averages (or 
equivalently, their average over an infinite volume). When it comes to observing the 
universe however, we are limited to finite volumes. In this case, the theoretical power 
spectrum of the field over this volume becomes a random variable: its expectation 
value is given by the ensemble average, but it is subject to a sampling variance (in 
this context also known as cosmic variance) which depends on the size and shape of 
the volume observed. Note that sampling variance is an intrinsic theoretical uncer¬ 
tainty; the only way to reduce it is by increasing the observed volume. This means 
that even if we had a perfect measurement, the power spectrum constructed from 
these data will be subject to a scatter which can obscure or even mimic the presence 
of features. 

Thirdly, the scatter is compounded by statistical uncertainties that all realistic 
measurements are subject to (e.g., due to finite detector resolution, detector noise, 
shot noise, etc.). This problem can at least in principle be overcome with better 
technological means; once these uncertainties are reduced to below the level of the 
sampling variance, they cease to be relevant. 

Finally, there is only a finite window of wavenumbers accessible to observation. 
The size of the universe’s particle horizon today sets a lower limit on the wavenumber 
of observable perturbations fc > 2 x 10“’^ Mpc“^. The upper limit depends on the 
probe: structures at low redshifts {z 1) have undergone non-linear evolution on 
scales k > Mpc“^, where mode-coupling has erased all but the broadest 

features. At higher redshifts non-linear evolution becomes less of an issue, though in 
the case of the CMB, the original signal is significantly suppressed by Silk-damping 
from photon diffusion [314] and tends to be swamped by foregrounds beyond k > 
0(10“^) Mpc“^. For other probes of the (matter) power spectrum such as 21 cm 
tomography, it may in principle be possible to see linearly coupled modes out to fc « 
0(10^) Mpc“^ Therefore, given presently available data and being generous, 

the observable window spans a good four orders of magnitude in fc, corresponding 
to roughly 10 e-foldings of inflation; features produced outside this range are hidden 
from our direct view. However, as previously mentioned and as we shall elaborate 
upon further below, it will soon be possible to open the window on the shorter 
wavelength modes by tapping the information contained in spectral distortions and 
21 cm surveys, the former potentially allowing us a window onto comoving scales 
up to fc « O(IO^) Mpc“^ I315j . This would add another 5 e-foldings of inflation 
to the picture. 

So what options do we have to get a handle on the primordial power spectrum? 
In this section, we will cover the most direct approach: measuring inhomogeneities 
(or anisotropies) of the universe’s constituents and relating them to primordial per¬ 
turbations. All energy components of the ACDM model except for dark energy (i.e., 
photons, neutrinos, baryons and cold dark matter), have inhomogeneities, making 
them potentially useful sources of information about the initial fluctuations. Out 
of these, we can exclude the perturbations of the neutrino background: while in 
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principle a good probe of the primordial power spectrum |316j , cosmic background 
neutrinos are elusive creatures and although their direct detection may be within 
reach with current technology |317H319] , a detection of their anisotropies is unlikely 
in the foreseeable future. This leaves us with photon and matter perturbations. The 
latter can be revealed either via visible tracers such as galaxies and neutral hydro¬ 
gen gas, or through their weak gravitational lensing effect on background sources. 
The perturbations of the photons (or, more precisely, of the baryon-photon fluid 
around decoupling) can be directly observed in the form of temperature and po¬ 
larisation anisotropies of the cosmic microwave background. Furthermore, looking 
beyond anisotropies, one may potentially be able to constrain or even detect pri¬ 
mordial features with just the monopole through distortions to the CMB spectrum. 
We elaborate upon this intriguing idea in detail in Section 


4.1. Cosmic microwave background anisotropies 

Currently, the by far most sensitive probe for features, and the only one able to 
reach the lower limit of the observable wavenumber window, are the anisotropy 
measurements of the cosmic microwave background radiation. From the observed 
maps of the CMB’s temperature fluctuations and pol arisation, one can extract the 
angular power spectra where X,Y G {T,E}I^ These are defined as follows- 

first consider expanding the observed temperature anisotropies in the sky in terms 
of spherical harmonics 


AT 

f 


(n) = '^aJ^Yemin) 

£m 


(126) 


where n is a unit vector on the sphere and where the aj^ are stochastic variables. 
Were we to presume statistical isotropy, it can be shown that the average over an 
ensemble of universes satisfies 


= Ce Su'S, 


(127) 


which defines the Cf^’s as the angular auto/cross correlation function for the 
variables X/Y in question. The observed are to be compared to the theoretical 
prediction, Cf'^. In the absence of initial vector and tensor perturbations, to linear 
order, the are determined by the scalar contributions to the temperature 

and E-polarisation auto- and cross-correlation angular power spectra, which can be 
expressed in terms of the power spectrum of the comoving curvature mode as |320] 


^ J dlnfc Af{k,To)Aj{k,To)Vn{k). 


(128) 


ixvThe 5-component of the CMB polarisation is not sourced by scalar perturbations at linear 
order and hence not helpful in probing 


















June 22, 2015 


53 


The transfer functions Af- are integrals over comoving time r, 
Af (fc,ro) = j dr {k,T)ji{k{T - tq)), 


(129) 


with a geometrical part given by the spherical Bessel functions of order £, and 
the source functions which contain the dependence on non-primordial cosmolog¬ 
ical parameters such as the matter and baryon densities. The source functions are 
typically evaluated using a Boltzmann code (see |321) for a detailed derivation and 
[47] for a reasonably accurate analytic understanding up to £ ^ 200). The thus 
computed are ensemble averages; for observations of the full sky, we can estimate 
the angular cross/auto power spectra by constructing the following estimator: 

1 




Y* 

ImYlm- 


(130) 


From (1271 we see that although this estimator is unbiased in the sense that 


{cr) = c, 

it has a non-zero (cosmic) variance 1^ . 




(131) 


var := {Cf^CD - {CDifiD 

1 


(132) 
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f/ X X \ / Y* Y* \ , / X Y* \ / X y=ic\l 
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cf^cr+c, 


XY^ 


where the third line follows from the second by presuming that the atm^s satisfy 
almost Gaussian statistics and that they correspond to real variables (so that = 
. Therefore for the auto-correlation functions we have 


af_ 


var 

2£ -I- 1 


(133) 


That is to say, the are random variables with mean and subject to a 


cosmic (sampling) variance given by (132). Realistic experiments will not be able 
to observe the CMB over the full sky, either due to experimental design (e.g., for 
ground-based or balloon experiments) or the need to mask areas where foregrounds 
cannot be removed, such as the galactic plane. If only a fraction /sky of the sky is 
covered, the cosmic variance will scale roughly with a factor l//sky, and the inferred 
are no longer uncorrelated. 


r^XY 


Looking at equation (128), schematically, the angular power spectrum is a con¬ 


volution of the primordial power spectrum with a window function, given by the 
prodnct of two transfer functions. Invariably, this convolution leads to a loss of in¬ 
formation: features in Vn^k) will be smeared out to a certain degree and show up 
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k/Mpc"^ 

Fig. 11. Temperature auto-correlation angular power spectrum window functions for multipoles 
£ = 2 , 20, 200 and 2000, normalised to their respective maxima. 


less prominently in C( If the characteristic width of a feature in fc-space exceeds 
the width of the window function on the corresponding scales, its amplitude in 
the angular power spectrum will be approximately suppressed by the inverse ratio 
between the two. Thus, provided a sufficiently high frequency, even a relative mod¬ 
ulation of the primordial spectrum of 0(1) could be hidden in the angular power 
spectrum- though the deviation from linear evolution required to produce such a 
large modulation is likely going to induce a strong signal in higher order correlations 
(see Section 4.3). 


4.1.1. CMB temperature anisotropies 

We show the window functions of the temperature anisotropies for multipoles 
£ = 2, 20, 200 and 2000 in Figure El Their rapid oscillations are due to the Bessel 
functions, and changing the later-time cosmology will modify the envelopes. As 
one goes to higher multipoles, the log-width of the window functions decreases 
from about 0.4 at £ = 2 to 0.045 at £ = 2000 (assuming the Planck best-fit ACDM 
late-time cosmology). At multipoles £ > 2000, the primary CMB temperature signal 
becomes rapidly subdominant to secondary microwave anisotropies, e.g., from point 


Besides the smearing due to the geometrical projection, the observed angular power spectra are 
themselves subject to further convolutions due to (i) the effects of weak gravitational lensing on 
the CMB [323], and (ii) the aberration caused by the proper motion of the observer with respect 
to the CMB rest frame [32l]. 
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Fig. 12. Top; Planck full mission measurement of the angular power spectrum of the temperature 
autocorrelation of the CMB [^. Bottom: Residuals in units of standard deviations with respect 
to the best-fit ACDM model. For £ > 30, the power spectrum is binned. The error bars denote 
the variance due to sampling error and experimental uncertainties due to, e.g., detector noise, 
component separation or beam errors. 


sources, the cosmic infrared background or the Sunyaev-Zeldovich effects |32511.S26j . 
which are which are much less sensitive to the presence of features. Thus the range 
of wavenumbers over which the CMB temperature spectrum is sensitive to features 
is given roughly by the interval 0.0002 Mpc~^ ^ k < 0.15 Mpc~^. This range has 
in fact been almost entirely covered at the cosmic variance limit by the Planck 
mission (see Figure 121. 


4.1.2. CMB polarisation anisotropies 

The temperature anisotropies of the cosmic microwave background are not the only 
source of information one can obtain from the decoupling era. The CMB is weakly 
linearly polarised due to Thomson scattering (the low energy limit of Compton 
scattering) of the CMB photons during decoupling |32711328] , sourced by the lo¬ 
cal quadrupole anisotropy in the incident radiation intensity field. Measuring the 
angular power spectrum of the FI-component of the CMB polarisation is a power¬ 
ful complementary tool in the search for features in the primordial spectrum and 
potentially even surpasses the temperature in terms of sensitivity. Not only is the 
small-scale polarisation signal to a lesser degree subject to foreground contamina¬ 
tion, with the primary signal dominant up to £ ~ 4000 |32911330] , the polarisation 
window functions for a given multipole are also narrower than the corresponding 
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CM 



temperature window functions (see Figure 131 |331] . leading to more sharply de¬ 
fined features in the observable ill-autocorrelation spectrum. Here, the log-width 
of the window functions ranges from roughly 0.13 at i = 2 to 0.03 at £ = 2000. 
It should be noted however, that on the largest scales (£ < 20), the polarisation 
power spectrum is dominated by the contribution generated during reionisation. At 
those scales, features in the iHA-spectrum could also imply a deviation from the 
commonly assumed nearly instantaneous reionisation model, rather than point to a 
primordial origin. 

Mortonson et al. US], for example, considered a putative oscillatory feature 
around k = 2 — 5 x 10“^ Mpc“^ (corresponding to 20 < £ < 40) which improves 
the fit to CMB temperature data by 2Aln£^^ « 10 (see Sect. for a defini¬ 
tion of the former quantity). In a cosmic-variance limited measurement of the ill- 
polarisation autocorrelation spectrum, such a feature can be expected to lead to 
2Aln£^^ « 60 when the reionisation history is known, and ~ 30 if it is allowed 
to freely vary. Since the iHiH-spectrum is suppressed by a few orders of magnitude 
with respect to the temperature spectrum, measuring it demands much higher in¬ 
strumental sensitivity. Unlike Planck’s temperature data, its polarisation data is 
not limited by cosmic variance (but nonetheless adds useful information at inter¬ 
mediate scales [1811135| b An all-sky cosmic variance-limited measurement to much 
smaller scales is feasible with current technology and several experimental designs 
have been proposed [33011332L1333] . Although none of the former have been selected 
for funding to date, ground based and sub-orbital CMB polarisation measurements 
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(which either by themselves |334j or taken in combination |335H348] , promise almost 
full sky coverage) remain one of the most promising future avenues for the detection 
of features, especially on the largest scales. 


4.2. The matter power spectrum 

Long after recombination, the initial inhomogeneities imprinted in the CMB evolved 
into the scaffolding upon which large scale structures formed. The central quantity 
to describe the statistical properties of these inhomogeneities (which are matter 
density fluctuations around the mean density) given by Spmjpm = <Jm, is the matter 
power spectrum Vni{k, z), defined via 

27r2 

((5m(k,z)dm(k',^)) = -p-(5^(k-k') V^(k,z). (134) 

As long as the fluctuation amplitude is small, i5„i ^ 1, matter fluctuations of differ¬ 
ent wavenumbers evolve independently and Vm is directly related to the primordial 
power spectrum of curvature fluctuations via 

r^{k,z)= J dk' T^{k,k',z)Vn{k)^T^^{k,z)Vn{k), (135) 


where is the (linear) matter transfer function containing the dependence on 

the intervening cosmology. Compared to the CMB where information about the 
fluctuations is fundamentally limited to the two-dimensional surface of last scat¬ 
tering, the distribution of matter can be probed in three dimensions. This has two 
important consequences: firstly, one gains sensitivity to the time evolution of the 
power spectrum which is very helpful in disentangling primordial features from 
non-primordial ones*’'''" Secondly, the ability to detect density fluctuations along 
the line of sight results in an increase of the number of observable modes for a 
given wavenumber, greatly reducing the uncertainty due to sample variance, with 
vaiVra{k) oc V~^k~‘^Vm{k) for a given volume V. 

Despite the ability to recover some 3-dimensional information, a finite-volume 
survey will still be subject to a limited resolution in k. This becomes apparent when 
one considers the spatial correlation function, which is related to the power spectrum 
by a Fourier transform. Take for instance a sinusoidal modulation of V-jiik) with 
frequency w: in the correlation function this corresponds to a spike at a length scale 
I oc w. If for a given frequency w the survey volume is too small to accommodate 
this length scale, it will not be possible to see the spatial correlation and thus one 
also will not be able to resolve the modulation. As a consequence, observations of 
the matter perturbations over a finite volume only allow us to construct an estimate 


l^viiConsider, e.g., the suppression of power on small scales due to the free-streaming effect of 
massive neutrinos. This is a time-dependent effect that becomes more pronounced at later times, 
as opposed to a primordial power spectrum suppression, which will show up equally at all redshifts. 
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Fig. 14. Relative fluctuations for two example primordial features (top) and the correspond¬ 
ing modulation of the CMB temperature and FJ-polarisation angular power spectra (centre, thin 
black lines mark the cosmic variance). Bottom: signature of the features in the matter power 
spectrum convolved with the window function of Eq. |137| assuming effective survey volumes of 
= 1 Gpc^h~^ (dotted lines) and = 50 Gpc^h“^ (solid lines). 


of the power spectrum in band powers Vi [349] . which are related to the full power 
spectrum in terms of a convolution with a set of window functions yVi{k)\ 


V,{ki) = j dk'W,{k-k')V{k'). 


(136) 


Depending on the survey geometry, one can construct an optimal set of window 
functions that are as narrow and uncorrelated as possible without compromising 
the variance in V |350j . For a volume-limited all-sky survey out to a distance i?, the 
optimal window function in the large-scale limit is given by 

whose width scales like R~^. This effect is illustrated in the bottom panel of Fig. 
for effective survey volumes = 1 Gpc^h~^, representative of current surveys 
(e.g., SDSS LRG |351j ~l. and = 50 Gpc^/i“^, which is within reach of upcoming 
projects like the Large Synoptic Survey Telescope (LSST) |352j or Euclid [3T]. In 
the example here, the feature at large scales cannot be resolved. The width of W for 
a i? ~ C>{1) Gpc is 0(0.001) IiMpc”^, so structures in the spectrum with a smaller 
characteristic size Afc will be erased. 
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When it conies to mapping the distribution of matter in the universe, we can, 
depending on the circumstances, draw upon three different methods for the matter 
to reveal its presence: (i) gravitational leasing, (ii) absorption or (iii) emission of 
photons. The first category can be exploited by measuring the weak gravitational 
lensing of galaxies |J5Jj or the CMB |J2Jj . While weak leasing has the great ad¬ 
vantage of probing all forms of matter, it is, unfortunately not very sensitive at all 
to features in the spectrum. The reason is that lensing is an integrated effect and 
depends on the matter distribution at all redshifts between source and observer; 
as a result, the lensing angular power spectrum for a given multipole has contribu¬ 
tions from Vm over a large range of wavenumbers, which leads to a very efficient 
smoothing of any but the most extreme primordial features. Just like weak lensing, 
exploiting the second category needs a background light source in whose spectrum 
absorption due to absorbing matter can be detected. In a cosmological context, the 
absorber would typically be neutral hydrogen due to its ubiquity in the early uni¬ 
verse. The distribution of matter along the line of sight can then be reconstructed 
by relating the redshift of particular absorption lines to the radial distance of the 
absorber and inferring its density at that distance from the absorption strength. The 
most straightforward probes of the matter power spectrum are based on emission, 
and in the following we will focus on the prospects of using galaxies and neutral 
hydrogen as tracers of V^. 

As long as the density fluctuations are small, i.e., 5p ^/<C 1 the matter power 
spectrum is well approximated with linear perturbation theory. At late times and 
small scales, however, the fluctuations are no longer small, linear theory breaks 
dowr |*’^^‘^ and a precise determination of the non-linear spectrum becomes 

a rather difficult problem. From the perspective of looking for features, the break¬ 
down of linear theory has another unwelcome side-effect: in the non-linear regime 
fluctuations of a given wavenumber k do not evolve independently of neighbour¬ 
ing wavenumbers anymore (mode-coupling), which is going to further suppress fine 
structures in the spectrum. 

4.2.1. Galaxy surveys 

The use of galaxies as a tracer of the matter distribution has a long history in 
cosmology. The measurement of their positions in the sky, along with their red- 
shift, from which one can infer their radial distance, allows one to construct their 
three-dimensional power spectrum Vg{k). However, since the formation of galaxies 
depends on the density of matter in a non-trivial way, the galaxy power spectrum is 
not actually the same as the matter power spectrum which we can easily calculate 
from theory. Rather, it is a biased tracer of Vm{k). 

On linear scales, the two are related by the simple relation Vg{k) = lP'{z)'Pm{k) 
with a redshift-dependent proportionality factor, the bias > 1, which in general 


ixviiig^^- redshift 2 : ~ 0 this starts becoming relevant for wavenumbers k > 0.05/i Mpc 
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also depends on the type of galaxies considered. On non-linear scales, the bias 
also picks up a scale-dependence Vg^{k){k) = b‘^{k, z)V^{k) which is even more 
challenging to predict than the non-linear corrections to the matter power spectrum. 
The wavenumbers at which non-linear corrections and the scale-dependent bias 
for a given type of object start becoming important do increase as one goes to 
higher redshift, but at the same time accessing higher redshifts requires tracing 
intrinsically brighter objects which tend to be more strongly biased. At the moment 
these effects are not even well enough understood for the standard cosmological 
model, let alone features models. Conservatively speaking, it will be very difficult 
to obtain reliable information from measurements of galaxy clustering on scales 
beyond k « 0.1 /iMpc”^. 

At present, galaxy power spectrum data are not as constraining as CMB tem¬ 
perature data which cover the same range, since only a small region of our cosmic 
neighbourhood is known to us, and the sample variance cannot compete with the 
CMB’s cosmic variance. Luckily, this is about to change though: within a decade 
from now, ongoing surveys such as the Dark Energy Survey (DES) [354] and next- 
generation large-volume surveys like DESI [355] . the LSST [352] or Euclid [H] are 
going to map out a good fraction of our surroundings up to a redshift z « 2, allow¬ 
ing us to take advantage of the V~^ scaling of the sample variance and eventually 
beat the current gold standard set by the CMB. 

It is not only the lower sample variance, but also the possibility of obtaining 
3-dimensional information that distinguishes probes of the matter power spectrum 
from the CMB. As can be seen from the bottom panel of Figure for a typical 
feature in this intermediate wavenumber range, the finite window functions of a 
survey with effective volume of 0(1 — 10) Gpc^ h~^ do not lead to a signihcant 
degradation of the signal, unlike the CMB signal, which is markedly attenuated in 
this region. Features with characteristic sizes smaller than Ak « 0.001 hMpc”^ on 
the other hand, cannot be resolved, which essentially makes a detection of any but 
the broadest features on scales k < 0.01 Mpc“^ unrealistic. Nonetheless, within an 
intermediate range of wavenumbers 0(0.01) /iMpc”^ ^ k < 0.1 hMpc”^, galaxy 
surveys are a very powerful resource and hold a lot of potential for the discovery of 
features. 


4.2.2. The Lyman-a forest 

Additional insights on features in the power spectrum may be gained at redshifts 
beyond z « 2 from the Lyman-a-forest, i.e., absorption signatures of neutral hydro¬ 
gen in the spectra of objects such as quasars or Lyman break galaxies caused by the 
Lyman-a transition [356] . from which the flux power spectrum can be extracted. 
While individual absorption spectra only yield an estimate of a one-dimensional 
power spectrum which would not be very helpful for the purpose of resolving 
features, combining the information of a sufficient number of them allows the 3- 
dimensional power spectrum (or its Fourier-equivalent, the correlation function) to 
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be estimated |357H359j . 

The Lyman-a-forest covers a redshift range of 2 < z < 4, and at these redshifts, 
non-linear evolution of the underlying matter power spectrum does not play as big 
a role as it does for the galaxy power spectrum, giving in principle access to fluc¬ 
tuations on smaller scales 0(0.01) hMpc”^ ^ k < 1 /iMpc”^. However, the gas 
physics relating the flux power spectrum to the matter power spectrum is compli¬ 
cated and requires computationally expensive hydrodynamical simulations; to what 
extent this would affect features is an open question, but it does not seem unreason¬ 
able to assume that one can expect at least a qualitative sensitivity to deviations 
from smoothness from these data. 


4.2.3. 21 cm surveys 

An even more radical idea to use neutral hydrogen as a tracer of the matter dis¬ 
tribution and reconstruct its three-dimensional structure is by measuring emission 
or absorption due to the 21 cm hydrogen spin flip transition |360) . This could be 
done either via a measurement of the emission caused by reionisation [361] in a 
redshift range of 6 < z < 15, or via the absorption of CMB photons during the dark 
ages 15 < z < 200. Whereas the redshift range of galaxy surveys (z < 2) covers 
only about 5% of the Hubble volume, 21 cm emission could in conceivably make 
an additional ~ 20% of the Hubble volume accessible, plus another ~ 40% from 
absorption [362] (assuming full sky coverage) and hence in principle allowing survey 
volumes up to 0(1000 Gpc^h“^). With correspondingly narrower window functions, 
21 cm surveys will be able to extend sensitivity to features down to wavenumbers 
k ~ O.OOl/i Mpc“^, reaching almost the range of the CMB. Furthermore, the matter 
fluctuations at the redshifts targeted by 21 cm surveys are even smaller, and thus 
even less subject to non-linear effects, opening the window to features on scales as 
small as k = O{10'^)h Mpc“^, thus allowing us to extend our search for features to 
scales inaccessible to other probes of spatial fluctuations. 

While obtaining all the potential information from such a 21 cm survey would 
certainly be the ultimate dream of any cosmologist, it is at this point not clear 
whether this goal can ever be achieved. The cosmological signal is extremely feeble, 
and disentangling it from astrophysical foregrounds is going to be a very challenging 
task [363] for the reionisation era, and even more so for the dark ages. To date, no 
cosmological signal has been detected although experimental programmes are on 
the way to evaluate the potential of this observable. Upcoming radio-surveys such 
as the Square Kilometre Array m or the Tianlai project [365] do have cosmology- 
related science goals, but a competitive sensitivity to features still lies in the more 
distant future, and will require an experiment such as the Fast Fourier Transform 
Telescope [HHH] . 
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4.3. Higher order correlators 

As discussed in sections and any features in the two point correlation func¬ 
tions will correlate with features in higher order correlation functions for adiabatic 
fluctuations. In the CMB, one can thus entertain looking for such features in the 
bispectrum. Although detecting small amounts of non-Gaussianity with reasonable 
statistical significance remains the observational challenge of the day, the possibility 
that cross correlations (possibly even among different tracers) might exist with the 
power spectra enhances the prospects for their detectior[i^^ 

How does one then go about searching for primordial non-Gaussianities? Recall 


first that (126), the expression for the angular power spectrum (128) follows from 

d^k 


T 


= 47r(— 


(27r)' 


l^i{k,To)n{k)Yi,n{k) 


(138) 


where the A^’s are given by (129) and where k is the unit norm vector on the 


(momentum) 2-sphere. Analogous to the definition of the angular power spectrum 
dl^ , we see that given the definition of the bispectrum ([6^ 


%^(t)^gjt)%^(t)) = (2^)3 j(3) BnikMM. 


we can define the temperature angular bispectrum in analogy to (127) as 

_ /„T T T \ 

^771117121713 


(139) 


From (138) and (64) it can be shown that the angular bispectrum relates to the 


momentum space bispectrum as [36711368j 

3 3 

^771177127713 


y^dy k'fdki ^^(fci, *2, k3)Ai^ {ki,To)Ai^{k2, To)At^ (fcg, tq) 
X Ml {kiy)je2 (* 2 y) j^3 (ksy) (140) 


where 
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results from the angular integration of the product of three spherical harmonicqj^ 
and where the quantities in the parentheses are the Wigner 3-j symbols. Since 
isotropy remains a good approximation to the observed sky, it is also common to 
work with the averaged bispectrum defined as 

£2 4 

\mi m 2 m 3 . 
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ixixAlthough we shall focus the discussion that follows on the bispectrum, as reviewed in ED , one 
can look to quantify primordial non-Gaussianities in a number of other ways including making use 
of Minkowski functionals and wavelets to name but a few. 

ixxThe dummy integral over y results from switching to the integral representation of fcj). 
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Given the inherent difficulties of extracting 
three point functions of putatively very weakly 
coupled perturbations from observations'™* it 
is much more practical to ask the question of 
how significantly (in the statistical sense) a par¬ 
ticular shape of the bispectrum is realized in the 
actual data. To this end, we need a useful way 
of characterizing a potentially arbitrary signal. 




Although the /nl class of diagnostics (66) are 



Fig. 15. The domain V of tS(fci,/c 2 ,/ca) 


(143) 


useful for the sort of non-Gaussianities expected 
from specific classes of almost scale invariant 
models of single field inflation, they are inad¬ 
equate for searches for features, for more gen¬ 
eral models of inflation |369H372j , or for the type 
of non-Gaussian signals expected from topologi¬ 
cal defects. Shellard, Fergusson and Liguori have 
proposed just such a general scheme [55711^ 

which we review briefly in what follows. We first define the shape function 

S{ki,k2,k3) = ^ {kik2k3f Bn(ki,k2,k3) 

where is some suitable normalization that depends on the shape in question. 
Given that the observable number of modes is finite (the upper limit coming from the 
scale at which foregrounds start to dominate the GMB signal), the domain for the 
shape function is given by the tetrapyd defined by ki,k 2 ,k 3 < fcmax in conjunction 
with the triangle inequalities ki < k 2 + k 3 (-1- permutations). We denote this domain 
V, illustrated in Fig. We now ask, given an actual realization in the GMB of 
the bispectrum shape A, how well would an estimator based on some other shape 
B pick up this signal? For this, we introduce the notion of a ‘cosine’ between two 
shapes |373) . defined as 

{Sa,Sb) 


{Sa^SaY/HSb.SbY/^ 

where the scalar product between two shapes {A, B) is defined as 


(144) 


{A,B):= [ cIVloS^Sb, (145) 

Jv 

where one is obliged to introduce a non-trivial measure a;(fci, A: 2 , ^a) to weight for 
non-trivial scaling of the bispectrum. For the purposes of the present discussion we 
don’t need to consider it further, referring the interested reader to |368LI370j for de¬ 
tails. The size of this overlap indicates how much an estimator optimized for shape 


ixxiThe bispectrum signal being too weak to be able to extract individual multipoles- the only 
practical option being to use an estimator that sums over all mulitipoles (i.e. sky averages). 
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B would detect the true signal from shape A. Clearly, if we had a basis of estima¬ 
tors that could capture an arbitrary shape, we would in principle be sensitive to an 


arbitrary signal. With the scalar product (145), one can indeed construct such an 


(orthonormal) basis of functions via the Gram-Schmidt procedure on V. The precise 
basis one obtains depends on the root basis vector one starts the orthogonalization 
procedure with. A separable polynomial basis suitable for various scale invariant 
bispectra was elaborated upon in [1167111168111174111175] . For sensitivity to oscillatory 
signatures in the bispectrum (as is expected in resonance models), Meerburg pro¬ 
posed a separable Fourier mode basis (.'176|‘’^’‘“ More recently, Byun et al. proposed 


a basis of localized piecewise spline functions tailored towards non-separable bis¬ 
pectra [377]. Given a particular choice, one can then make a modal expansion of a 
given shape function as 


S{ki,k2,k3) = y^a^Rn{ki,k2,k3) 


(146) 


expanded in terms of the orthonormal basis vectors 

{Rn^Rm') — dnm 

where the coefficients are obtained as 

= {S,Rn) 


(147) 


(148) 


In this way, one can in principle characterize an arbitrary bispectrum shape (al¬ 


though the rapidity of convergence of the expansion (146) will depend on the com¬ 


patibility of the basis chosen with the realized shape). To date, searches for features 
in the bispectrum alone have been inconclusive although recent efforts to cross cor¬ 
relate these with searches for features in the power spectrum have returned results 
of statistically marginal significance [1281137814380] . In the near future this enter¬ 
prise will receive an added fillip from the results of a variety of LSS surveys (as 
discussed earlier) [3011381] . However, extracting information about primordial non- 
Gaussianity from LSS observables such as the galaxy bispectrum is still subject 
to host of theoretical uncertainties (concerning scale dependent biasing at shorter 
comoving scales in addition to requiring a better understanding the relationship 
between the observed matter power spectrum and bispectrum and their primordial 
counterparts at mildly non-linear scales) [382H384] , although this promises to be an 
active field of research in the coming years. 


5. Observables: measuring CMB spectral distortions 

So far, we only discussed GMB observables related to the temperature and polari¬ 
sation anisotropies. There is, however, another valuable piece of information which 


'’‘’‘“Expressions for the first few basis vectors in the polynomial and Fourier mode bases are detailed 
in the respective l368l 13761 . 
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may become available in the future: the CMB energy spectrum. Since the mea¬ 
surements of COBFi/FIRAS |385l I386j . the spectrum of the CMB is known to be 
extremely close to a perfect blackbody at a temperature of Tg = (2.726 ± 0.001) K 
today [ 557 ] . Even if no deviation from a blackbody spectrum — commonly referred 
to as spectral distortion — was detected, this remarkable observation places very 
tight constraints on the thermal history of our universe, ruling out cosmologies with 
extended periods of significant energy release that could have disturbed the equi¬ 
librium between matter and radiation in the post-BBN era. One source of energy 
release is due to the dissipation of small-scale perturbations in the photon-baryon 
fluid |388H391[ I315L1392] , as we explain here. 

Nearly 25 years have passed since the launch of COBE, and from the tech¬ 
nological point of view already today it should be possible to improve the ab¬ 
solute spectral sensitivity by more than three orders of magnitude [251129|. This 
will open a new window to the early universe, on one hand allowing us to di¬ 
rectly probe processes that are present within the standard cosmological paradigm 
(e.g., distortions from the reionisation and recombination epochs), but on the 
other hand also opening up a huge discovery space to unexplored non-standard 
physics (e.g., decaying/annihilating relic particles, evaporation black holes or cos¬ 
mic strings) [for overview see I393H396L 155] . At this stage, CMB spectral distor¬ 
tion measurements are furthermore only possible from space, so that in contrast 
to B-mode polarisation science competition from the ground is largely excluded, 
making CMB spectral distortions a unique target for future CMB space missions 
[ 557 ] . As we highlight in this section, CMB spectral distortions can help us to con¬ 
strain the shape and amplitude of the small-scale power spectrum at wavenumbers 
few X Mpc“^ ^ ^ 2 X 10^ Mpc“^, scales that are inaccessible by other means and 

complement CMB anisotropy measurements at larger scales. 


5.1. Thermalisation physics primer 

The physics going into the cosmological thermalisation calculation — the process 
that restores the pure blackbody spectrum after some departure from thermal equi¬ 
librium — are pretty simple and well understood, allowing us to make precise pre¬ 
dictions for different thermal histories and energy release scenarios. For primordial 
spectral distortions, we are mainly concerned with the average CMB spectrum, so 
that spatial perturbations can be neglected and the universe can be described as 
uniformly expanding, thermal plasma consisting of free electrons, hydrogen and he¬ 
lium atoms and their corresponding ions inside a uniform bath of CMB photons. 
We shall also restrict ourselves to redshifts z < few x 10^, when electron-positron 
pairs already completely disappeared, since earlier thermalisation is perfect from 
any practical point of view and no observable distortion remains. 

Under these circumstances, any energy release inevitably causes a momentary 
distortion of the CMB spectrum. This can be understood with the following simple 
arguments: a pure blackbody spectrum, B^(T), is fully characterised by one number. 
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its temperature T. Changing the energy density, cx of the photon field by 
<C 1 (e.g., from some particle decay) means that the photon number also 
has to be readjusted by ANy/N.y « (3/4)Ap^/p^ to restore the blackbody relations 
p-y oc and N-y oc T^. The photons, furthermore, have to be distributed according 
to ~ dB^/dT in energy, to correctly shift the initial blackbody temperature from T 
to T' Ki T + (l/4)Ap^/p^ without creating a distortion. Thus, the correct relation 
between photon number and energy density as well as the correct change of the 
distribution function are required to avoid a distortion. If one of these conditions is 
violated (e.g., if only energy is transferred) then a distortion is inevitable. 

In the early universe, the double Compton (DC) and Bremsstrahlung (BR) pro¬ 
cesses are controlling the number of CMB photons, while Compton scattering (CS) 
allows photons to diffuse in energy. The interplay of these interactions between mat¬ 
ter and radiation determines the precise shape of the CMB spectrum at any stage 
of its evolution. When studying different energy release mechanisms, one question 
thus is whether there was enough time between the energy release event and our 
observation to produce and redistribute those distortion photons thereby fully 


recovering from the perturbation and completing the thermalisation process. 


5.1.1. Distortion visibility function 

The thermalisation problem has been studied thoroughly both analytically |325l 
[8MM4fmi3Mll¥i8TOl5] and numerically 144)61141)91 [39^ . From these studies, 

the following simplified picture can be drawn: at z > 2 x 10®, when the universe 
is less than a few month old, the thermalisation process is extremely efficient and 
practically any distortion can be erased until today. At lower redshifts, the CMB 
spectrum becomes vulnerable to disturbances in the thermal history and only small 
amounts of energy can be ingested without violating the tight experimental bounds 
from COBF,/FIRAS |385L 13861141 Ij and other distortion measurements |412H414] . 

The transition from efficient to inefficient thermalisation is encoded by the dis¬ 
tortion visibility function, which determines by how much the distortion 

amplitude (regardless of its shape) is suppressed between two redshifts z and z' < z. 
Due to the huge entropy of the universe (there are ~ 1.6 x 10® times more pho¬ 
tons than baryons), the DC process |415H418] is the most important source of soft 
photons at high redshifts, so that the distortion visibility function is roughly given 
by J{z,z') ~ ^ with thermalisation redshift z^c ~ 1-98 x 10® 

which is determined by the efficiency of DC photon production and Compton redis¬ 
tribution |401L 140611407| . Improved approximations for the visibility function exist 
[40311405) . but for simple estimates the above expression suffices. For our purpose, 
we also set z' = 0 and then use J{z) = J'(z,0) = . The distortion vis- 


ix^iiiEnergy release is the most common mechanism to produce distortions. However, the adiabatic 
cooling of matter in fact extracts energy from the CMB [3981139.^ . so that an excess of photons is 
found in the CMB spectrum. In this case, DC and BR absorb photons. 
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ibility function thus cuts off exponentially for z > Zdc (see Fig. 171. How far into 
the cosmic photosphere |419| — the epoch of the universe during which 17 1 — 

one could view, thus depends on the absolute sensitivity of the experiment and how 
much initial energy had to be thermalised. 

With a PIXIE-type experiment [2S], one might be able to detect a distortion 
created as early as z ~ 6 x 10® if ~ 0.01 of energy were liberated by some 

process. At much later times (z < few x 10®), the distortion visibility is very close 
to unity (e-)- basically all injected energy will still be visible as a distortion today; 
see Fig. and the upper limits from COBE/F/i?AS' imply < 6 x 10 ® 

[386]. With a PIXIE-type experiment, this could be improved to t^p^jp^ < 8x10“®, 
allowing us to constrain tiny amounts of energy release related to standard processes 
occurring in our universe. 


5.1.2. Types of primordial distortions 

While the distortion visibility tells us how much of the released energy will still be 
visible as a spectral distortion today, it does not fix the shape of the distortion. Here, 
three regimes are most important: at z 2 x 10®, thermalisation is extremely effi¬ 
cient (distortion visibility J7 ^ 1) and CS, DC and BR are able to adjust the initial 
blackbody spectrum bJ{T) to B^{T + AT) « B^{T) + dTB^(T)AT + 0{AT^/T^) 
with AT/T « (l/4)Ap-y/p.y assuming that a total energy of Ap^/p^ ^ 1 was 
released. The spectral shape of a temperature shift [at lowest order in AT/T], 
Gv = {l/4)TdTB^(T) = j{2hi'^fc'^)xe^I— 1)® with x = hv/kT (see Eig. 16l, is 


not a distortion but just describes the change of the initial spectrum required to re¬ 
store full equilibrium. For this both scattering and number changing processes have 
to be very rapid. Since there is no prediction for the CMB monopole temperature, 
this part of the CMB spectrum can only be used to constrain changes in the CMB 
temperature; however, for this another method to determine the CMB temperature 
higher redshifts (e.g., recombination or BBN) is needed |42()H422] . 

In the next regime, valid until z ~ 3 x 10®, the efficiency of DC and BR gradually 
reduces while photons are still efficiently redistributed in energy by the Compton 
process. In this case, electrons and photons are in kinetic equilibrium with respect 
to CS, forming a chemical potential or /r-distortion [399] . but thermalisation stops 
being complete (o the distortion visibility function J7 approaches unity). Thus, the 
change of the initial blackbody is given by the superposition of a temperature shift 
and a pure ^-distortion, « 1.401 r9TBy(T)[0.4561 — kT/hv], 

At z < 10'^, up-scattering of photons by electrons also becomes inefficient and 
photons diffuse only little in energy. In this era, a Compton-y distortion is formed, 
also known in connection with the Sunyaev-Zeldovich effect of galaxy clusters [325] . 
The shape of a pure j/-distortion is given by « (1/4) TdTBi,{T)[x coth(a;/2) — 4], 
showing a characteristic decrease of the effective temperature at low frequencies 
and an increment in the CMB Wien-tail. The classical p- and y-distortion, have a 


slightly different shape (see Fig. 16). For a /r-distortion, the deviation from the CMB 
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Intensity signal for different heating redshifts 



Fig. 16. Change in the CMB spectrum after a single energy release at different heating redshifts, 
Zh- For ^ few X 10^, a temperature shift, Gi/, is created. Around 2 :^ — 3 x 10^ a pure p- 
distortion, M^y, appears, while at $ 10“^ a pure ^/-distortion, Yl, is formed. At all intermediate 
stages, the signal is given by a superposition of these extreme cases with a small (non-^/non-y or 
r-type) residual distortion that contains valuable information about the time-dependence of the 
energy-release process (Figure adapted from [Ho]). 


blackbody vanishes at ^ ~ 124 GHz, while the cross-over frequency for a y-distortion 
is ~ 217 GHz. With future experiments, one can thus hope to distinguish these 
two types of distortions, and since a /i-distortion can only be formed in the very 
early stage of the universe, its amplitude directly constrains episodes of early energy 
release at x > 5 x 10^ or until about 100 years after the Big Bang. 

The shape of any primordial distortion caused by energy release is close to a su¬ 
perposition of the extreme cases described above. But the reader probably noticed a 
gap in our description between 10^ ^ ^ 3 x 10®. In this regime, scattering becomes 

inefficient in redistributing photons over frequency and the distortion morphs be¬ 
tween a fj,- and y-distortion, but the transition is non-linear in the energy exchange 
and a smaller residual (non-y/non-y or r-type) distortion is formed in addition. The 
sum of y-, n- and r-distortion is sometimes called intermediate or hybrid distortion, 
but additional information is only gained from the residual distortion. Although in 
earlier numerical studies this regime was also mentioned |40611423j , only in the past 
years it was stressed that the residual distortion contains valuable time-dependent 
information [Ml SOI mni, which allows us to distinguish different energy release 
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scenarios This adds another dimension to the CMB, delivering more than 

just two numbers related to the /x- and y-distortion amplitudes. This is especially 
important since at late times (z < 10 — 20), the formation of structures reheats the 
medium to temperatures T ~ 10^ K—10^ K. In this era, a large uniform y-distortion 
is formed with y ~ 10“^ —10“® |425H43l] . which will swamp any primordial y-signal. 
Without the r-distortion we were left only with the amplitude of the /i-distortion 
and thus could just constrain the overall energy release! 


5.2. Computing and characterising the distortion 


It is straightforward to calculate the distortion for any energy release history di¬ 
rectly integrating the corresponding Boltzmann equations [e.g., iTnniHMiiTn7ii¥)^ . 
One flexible numerical approach is CosmoTherm [393], which for the first time al¬ 
lowed direct integration of the thermalisation problem explicitly including the time- 
dependence for a wide range of energy-release scenario. However, for case-by-case 
studies and parameter estimation full numerical approaches are too time-consuming. 
Fortunately, the problem can be simplified: generally we expect the distortion to 
be very small, so that the Boltzmann equations can be linearised. In this case, a 
Green’s function approach can be used |410] . This allows us to precisely calculate 
the distortion for a wide range of energy release scenarios. 

Given an energy release history, d{Q/p^)/dz, which describes the fractional 
energy release relative to the photon energy density as a function of redshift, with 
the thermalisation Green’s function, we can thus write |410j 


AC 


Gth(t 


d(Q/ Pj) j / 


(149) 


Here, Gthii^,z), encodes the relevant thermalisation physics which is independent 
of the energy release scenario (see Fig. 16), and AI^, is the final change of the initial 
blackbody after the energy release. With Eq. (149), one computation of AIi, only 
takes a tiny fraction of a second. This allows us to perform parameter estimations 
for different energy release scenarios, as first shown in [395] . The Green’s function 
approach is part of CosmoTherm and available at www.Chluba.de/CosmoTherm 
Alternative schemes were discussed in [404] . highlighting analytic approximations. 


5.2.1. Simple analytic estimates 

Ignoring the (small) correction from the r-distortion in the ^/y-transition regime 
(10"^ ^ ^ 3 X 10^)) we can estimate the shape of the intermediate distortion, 

describing it just as a superposition of p, and y terms. In simple words, knowing 
the fractional energy that is released during the j/-era and /x-era we obtain an 
approximation for the distortion shape. For the classical approximation, one simply 
assumes that the transition between /x and y is abrupt around z ~ z^y ~ 5 x 10^ 
[407] . Since at z < z^y, the distortion visibility function is extremely close to unity 
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3 ; - distortion [r-y transition jr - distortion 



Fig. 17. Simple approximations for the distortion visibility function and energy branching ratios 
in the different distortion eras. Here, ~ 1 — denotes the fraction of energy that is fully 

thermalised and we used ~ 1 — enforcing zero leakage of energy into the r-distortion. 


(0 all the released energy ends up as a distortion) and Gt\i{v,z) « 1)^, we have 


A/, 



Az' 


dz' = Yi, 


Ap^ 

Pi 


(150) 


Using instead the classical definition for the y-distortion, Fsz = 4Yi. [325], then 
implies lS.Ii, ~ 2/lsz with Compton-y parameter y « (l/4)Ap^/p-y|j,. 

At z > z^y, the distortion visibility drops, such that the effective energy release 
in the p-era is Ap^/p^|y « J'(z')[d((5/p^)/dz'] dz' with J{z) ~ _ 

The /r-distortion contribution can thus be estimated as 


A/, 




J{z') 


<^{QIpi 

dz' 


dz' = M, 


Ap^ 


Pi 


(151) 


In earlier derivations, another normalisation for p was used, Msz = M^/\A01 |399) . 
such that /SIi, « fJ-Mgz with p-parameter p « 1.401 Ap.y/pj\^. 

In addition, we can estimate the total temperature shift caused by the energy 
release process by simply computing the amount of energy that actually ther- 
malised: Apj/p,y\T « /p°°[l —J'(z')][d((5/p-y)/dz'] dz'. Putting everything together. 
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we therefore have 


A/y « Uy 


Ap^ 

-|- Gy 

<1 

P~i 

y 

P7 


P-y 


= 2/isz + M-Msz + Gy At- (152) 


Here, we introduced G* = 4Gy and At = AT/T « {l/T)Ap^/ p^\t. The different 
dehnitions for the spectral functions are historical and for the Green’s function the 
normalisation was hxed energetically, / Gy di^ = / My dv = / Yy di/ = p^{T)lA'iT. 

The approximations for both the distortion visibility function and the hybrid 
regime between the p,- and y-distortion can in principle be improved. One very 
simple extension is to estimate the branching ratios, Jy[z') and of energy 

into y and p, respectively |410j . This gives the approximations 


Ap^ 


f 

Pi 

y 

Jo 

<1 


[ 

P'f 


Jo 


d{Q/p^) 


dz' 


dz' 


for the effective energy release in the y- and p-era, with mni 

/ r , , 2.58\ 

Jy(z)^ 1+^ 


6 X 104 


Jy,{z) « 1 - exp - 


1 + z 
5.8 X 104 


1.88N 


(153a) 

(153b) 

(154a) 

(154b) 


To ensure full energy conservation (no leakage of energy to the r-distortion), instead 
one can use Jy,{z) « 1 — Jy{z) [see Fig. 17 . These expressions are accurate at the 


10% — 30% level for the concordance model. For higher precision, it is simpler to 
use the Green’s function method, so that we do not go into more detail here. 


5.2.2. Information in the residual (r-type) distortion 


The approximation, Eq. (152), only captures part of the full information stored by 


the spectral distortion, since it neglects the non-p/non-y contributions related to 
the r-distortion. This information can be accessed with future GMB spectrometers 
and provides valuable insight in the precise time-dependence of the energy release 
process, which then can used to constrain the scale-dependence of the small-scale 
power spectrum. But what is the new information that can be distilled from this? 
The total change of the CMB blackbody spectrum caused by a wide range of energy 
release scenarios can be expressed as 


AJy = y Ysz + p Msz + Gy At + R{v) , 


(155) 


where R{v) is the residual distortion contribution. Under real world conditions, 
this part of the distortion signal has no unique dehnition but depends on the ex¬ 
perimental settings such as the frequency range, number of channels, bandpasses. 
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Fig. 18. Residual distortion for different heating redshifts in the range 10“^ $ $ 3 x 10^. For 

the construction, we assumed instrumental parameters {z^min? t^max, Az/} = {30,1000,1} GHz and 
diagonal noise covariance (the figure is taken from l424l ~). 


and channel sensitivities [HU. These parameters (as well as contaminations by 
foreground components and systematics) determine the orthogonality (o how well 
different signals can be distinguished) of different distortion shapes and are thus 
crucial for the definition of R{v) with respect to /i, y and T. 

For single energy release at different redshifts, channels with width Av = 1 GHz 
in the range 30 GHz = r'min A "A ^^max = 1000 GHz and uncorrelated constant 
noise, A/c, per channel, the r-distortion is shown in Fig. |18| The typical amplitude of 
the r-distortion signal is largest at low frequencies {v < 60 GHz), reaching ~ 10%— 
30% of the total distortion signal m- Thus, a lot of the additional information is 
contained in this frequency domain. 

We now have a full description of the spectral signal for single energy release, 
consisting of specific parameters p = {/i,2/, At} and R{v,z)] however, what is the 
real information content of Rliz, z)7 Glearly, for a given experimental sensitivity, 
the differences between two cases R{v, zi) and R{v,Z 2 ) can only be explored if 
AR = R{u,Z 2 ) — R{v,zi) exceed the noise, A/c, at a sufficient level. Furthermore, 
for smooth energy release histories, the r-distortion is a superposition of signals from 
different redshifts. Thus, different energy release scenarios can only be discerned to 
a limited degree. 
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All this can be quantified more precisely using a signal eigenmode analysis [also 
known as principal component analysis (PCA)], which decomposes the residual dis¬ 
tortion signal into orthogonal/independent spectral components, = S^^\vi), 
in the different frequency channels, r'i, ranked by the degree to which they can be 
measured The details are not important here, but the PCA delivers a com¬ 

pression of the information from the multi-frequency spectral data, AA = AJ(r'i), 
to a number of new parameters, p = {^, y. At,/ ifc}, that characterise the distor¬ 
tion signal and its relation to the energy release history. The pk are the eigenmode 
amplitudes of the r-distortion signal, which can be expressed as 

= ( 156 ) 

k 

Typically one can expect to be sensitive to only a few of the /ifc, depending on the 
sensitivity of the experiment and the type of energy release. By construction, the 
Pk do not correlate with each other or any of the other spectral parameters and 
their error bars, Apk, can be estimated from the PCA. This is specific to the chosen 
experimental parameters, however, the constructed eigenmodes can also be used for 
other experimental settings although then the modes become correlated |424) . 

Thinking of the spectral signal, AA, as a A-dimensional vector in frequency 
space, this implies that 

= (157) 

i i 

For a given energy release scenario, the eigenmode amplitudes, pk, can thus be 
computed as simple dot product once the signal eigenmodes are determined. This 
greatly simplifies the parameter estimation and model comparison process |424) . 
For PIXIE-like settings, the eigensignal vectors, are part of CosmoTherm. 
We will use this method below to forecast the sensitivity of CMB spectral distortion 
measurements to features in the primordial power spectrum. 

5.2.3. What can we learn from the distortion signal? 

Different energy release scenarios cause different sets of signal parameters, pi. These 
distortion eigenspectra can be used for parameter estimation and model selection. 
To distinguish different models, their energy release histories have to differ at a 
sufficient level during the p — y transition era (10"^ ^ ^ 3 x 10®), where the 

produced r-distortion signal is largest. Models with m parameters can only be fully 
constrained if at least m distortion parameters can be accessed spectroscopically. At 
least one additional parameter is needed to select between two m-parameter energy 
release scenarios. 

In combination with measurements of the light element abundances, it is in 
principle possible to constrain At = AT/T (or equivalently the change of the CMB 
monopole temperature Tg) and thus the total amount of entropy production between 
BBN and today. For PIXIE-type settings, one can expect to reduce the uncertainty 
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of Tq from ~ ImK |387j to ~ 13 nK [95% c.l.. I424j . However, the constraints on 
post-BBN entropy production are currently limited to very large energy release at 
z >2 X 10® with Ap^/py ~ 0.01 [e.g.. 142011421) . so that generally one cannot learn 
as much from more precise measurements of Tq. 

As mentioned above, the value of the y-parameter will be dominated by the late¬ 
time distortion signal created by reionisation and structure formation. Although for 
a PIXIE-like experiment the uncertainty for the average y-parameter may decrease 
from ~ 1.5X 10“® |38fij to ~ 4x 10“® [95% c.l., US], the primordial contributions will 
be hard to separate in a model-independent way. Uncompensated atomic transitions 
in the recombination era may help us to refine the picture a little |432L 1433] . possibly 
allowing us to distinguish pre- from post-recombination y-distortions, but a more 
detailed discussion is beyond the scope of this review. For our purpose, the value of 
y will be interpreted as a foreground parameter and shall not be used to distinguish 
various energy release scenarios. 

We are therefore left with p' = {p,,pk} to learn about different energy release 
scenarios. For a PIXIF-like experiment the uncertainty of p, could decrease from 
~ 9 X 10-® [386| for COBF/FIRAS to ~ 3 x 10"® [95% c.l., Ell Ii24|. The possible 
detection limits for pi, p 2 and p 3 are ~ 3 x 10“^, ~ 2 x 10“® and ~ 7 x 10“®, 
respectively [424] . The value of p can be used as a proxy for the total energy re¬ 
lease, while the pk inform us about its time-dependence. It was, for instance, shown 
that decaying particle scenarios [e.g., I434H4371 for early discussions of distortion 
constraints] give rise to a different distortion eigenspectrum than annihilating par¬ 
ticles, so that these cases may be distinguished if a sufficient sensitivity is reached 
[39511424| . Similarly, one may be able to constrain the location and amplitude of 
features in the small-scale power spectrum, as we discuss now. 

5.3. Probing the small-scale power spectrum with distortions 

Spectral distortions could allow us to constrain the shape and amplitude of the 
small-scale power spectrum. The physics causing the distortion signal is very simple 
and just based on the fact that the mixing of blackbodies with different temper¬ 
atures does not produce a pure blackbody spectrum. In our universe, blackbodies 
at different temperatures are set up on various scales by inflation and the photon 
mixing process is simply mediated by Thomson scattering (no energy exchange re¬ 
quired!) at scales shorter than the photon diffusion length well within the horizon. 
In this section, we highlight how from the small-scale power spectrum of curvature 
perturbations, Pn{k) = 27r^fc“®U 7 ?,(fc), we can obtain the spectral distortion and 
corresponding constraints. 


5.3.1. Damping of small-scale acoustic modes 

Measurements of the CMB anisotropies cover a wide range of physical scales, start¬ 
ing from our horizon scale (wavenumber fc ~ 2 x 10“^Mpc~^) all the way to 
k ~ 0.2Mpc“^ deep inside the CMB damping tail (see Fig. 12). Parts of the CMB 
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sky that are separated by more than ~ 1° (or multipole I ~ 200) correspond to in¬ 
dependent regions of our universe, which at the time of recombination have basically 
not been in causal contact since the end of inflation. This characteristic angular scale 
corresponds to the size of the sound horizor p’^‘^| at recombination, ~ 150 Mpc, 
defining the location of the first peak in the CMB temperature power spectrum. At 
much larger angular separation, the observed temperature anisotropies still repre¬ 
sent the quasi-scale independent initial perturbations with small corrections due to 
the ISW effect. This is reflected in the nearly flat and featureless shape of the CMB 
power spectrum at ^ < 50 (see Fig. 12). In contrast, at smaller scales, the fluctua¬ 
tion power exhibit oscillations (e-)- acoustic peaks) which decrease in amplitude due 
to Silk damping caused by photon diffusion [314) . 

The CMB damping tail is now measured to high precision with ACT [43811439) , 
SPT |440) and Planck [44111422) . If we were able to undo the effect of Silk damping, 
we would find much larger fluctuations at small scales [e.g., see Fig. 7 of 1442) . The 
power stored by the initial perturbations was erased by the photon diffusion process 
and is equivalent to energy release [388H3901139311443) . Initially, this causes a y-type 
distortion which subsequently thermalises [3911144411392) . Physically, photons are 
simply mixed together by the isotropising effect of Thomson scattering (and free 
streaming after recombination), so that no energy exchange between matter and ra¬ 
diation is actually required to produce the distortion. Energy is just directly trans¬ 
ferred from the fluctuating part of the photon field (acoustic waves) to the uniform 
part, when comparing to an initial blackbody at the average CMB temperature. 

Without any more detailed derivations, it is thus already clear that larger small- 
scale power gives rise to larger distortions Furthermore, the shape of 

the small-scale power spectrum will determine the effective heating rate at different 
times and thus can be probed using spectral distortions [391113151144511446113951 
142411447) . The distortion also depends on the type of perturbations (adiabatic O 
isocurvature) [38911448H450] . although here we shall focus on adiabatic modes. If 
more small-scale perturbations are present in different directions, an anisotropic 
distortion could be created, allowing us to probe scale-dependent non-Gaussianity in 
the ultra-squeezed limit [45IH454] or possibly cosmic bubble collisions [397[ . Finally, 
also tensor and vector modes lead to dissipation; however, unless a very blue power 
spectrum is realised, the associated distortion signals should be sub-dominant [455F 

USD. 


5.3.2. Superposition of blackbodies and creation of distortions 

To understand the physics a little better, in Fig. we illustrate the superposition 
of blackbodies, B^{T), with two different temperatures, Ti and T 2 (equal weight), 
simply thinking about photons trapped inside a box. Initially, the photon distribu¬ 
tion at any location is a blackbody at a specified temperature. Thomson scattering 


ixxiv'pjjg comoving distance sound waves in the photon-baryon fluid traveled since inflation. 




















































June 22, 2015 


76 


Blackbody spectra Photon mixing Blackbody + y-distortion 




Fig. 19. Illustration for the superposition of blackbodies. We envision blackbody photons inside a 
box at two temperatures Ti and T 2 , and mean temperature = ^(^1 +^ 2 ) initially (left panel). 
Thomson scattering mixes the two photon distributions without changing the photon number or 
energy. The averaged distribution is not a pure blackbody but at second order in the temperature 
difference exhibits a y-type distortion in the Wien tail (right panel). 


converts the photon field to one uniform distribution, which afterwards is described 
by a blackbody at temperature Tb = (Ti + T2)/2 plus a y-type distortion |458H460] 


^ + ^ ^ 2G:(Tb) + lsz(rb, ly) y^up (158a) 

« Si,(Tb) + Yg^iTh, v) 2/sup (158b) 

« B^[T\,[1 + 22/sup]) + Vsz(Tb, u) 2/sup- (158c) 


Here, we defined the y-tjpe distortion function, Yg^^iTh, v) = lsz(rb, v) + 2 G* (Tb), 
and effective //-parameter, //sup = | (^2 — Ti)^/T|^ = lAT^/T^, for the two temper¬ 
atures Ti = Tb — AT and T2 = Tb -I- AT. The three ways of writing the averaged 
distribution are equivalent, but conveniently isolate the different limiting behaviours 
at low and high frequencies and the effect on photon number and energy density. 

We can see from Fig. that in the Rayleigh-Jeans part of the spectrum the 
difference between {I^) and B^{T\y) is extremely small, while in the Wien tail a 
distortion is clearly visible. The function yg* 2 (Tb, v) in Eq. (158b) captures these two 
extremes, vanishing for hv <C fcTb, but exhibiting a //-type dependence, F'g* 2 (Tb, v) « 
l^Sz(Tb, ly) « G*(Tb )^ for hiy kTp,. Computing the photon number and energy 
densities from Eq. (|158a), we find 


(N^) « iV^^Tb)[l + 6//sup] = lV“(Tb)[l + 3 ATVt2], 

(p^) « p^^Tb)[l + 12//sup] = P^\T^)[l + 6ATVt2], 


( 159 ) 
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respectively. Here, N}^^{T) oc and p^^(T) oc denote the number and en- 


.... 

ergy density of a blackbody at temperature T. We also used the integral relations 
4tt J Gt{n)di^ = 47r fYsz(n,i')diy = 4p^^(n), Jtt / Gt(n)diy/hiy = 3N^^(n) 
and J YsziThji') dv/hv = 0. 

Equation ( |159| ) shows that the number and energy densities of the averaged 
photon distribution are both a bit larger than expected from the mean tempera¬ 
ture, Tb- With respect to B^{Tb), an energy density of Ap^/p^^{Tb) ~ 6AT'^/T^ 
was added, while the number density changed by ANj/N^{Tb) « 3AT^/T^ = 
(l/2)Ap.y/p!^’^(Tb). This violates the blackbody relations for the number and energy 
densities [i.e., AN-y/N^{TY,) « (3/4)Ap.y/p!^’^(Tb)], indicating that the isotropised 
spectrum is indeed distorted. The relative energy stored by the initial y-distortion 
is Apy/p^{Tb) ~ 4j/sup — 2AT^/T|^ [cf. Eq. (158al], which is only 1/3 of the total 
energy contributed by the fluctuating part. The other 2/3 of energy directly raise 
the temperature of the uniform blackbody part, i.e., —)■ B^(T\y[l + 2j/sup]) 

in Eq. (158c), and thus do not source a distortion |391) . 

Once the photon field isotropised by Thomson scattering, the usual thermal- 
isation process starts, attempting to redistribute and replenish photons, convert¬ 
ing ?/ —>■ /r —^ AT. After thermalisation completes, in total AAt^/At^(Tb) « 
(l/4)Ap.y/p!^’^(Tb) « (3/2)AT^/r,^ of photons were produced by DC and BR emis¬ 
sion. This is exactly the right number of photons required to fully thermalise the 
^-distortion part of the uniform photon distribution and to obtain a final black¬ 
body at temperature T' = Tb[l -I- (3/2)AT^/T|^] following from the total energy 
conversion, Ap.^/p^ 


(Tb) ~ dAT^/Tl 


5.3.3. Computing the effective heating rate 

In our universe, a spectral distortion is created by the superposition of blackbodies 
of different temperature due to Thomson scattering. One additional aspect we need 
to consider more carefully is the scale at which dissipation occurs. At recombination, 
photons can freely travel over distances shorter than the Thomson scattering mean 
free path, At — l/cxA^e — 0.013Mpc (at z ~ 1100), which is ~ 10"^ smaller than 
the sound horizon at that time. At scales At ^ A < Ah (with Hubble horizon Ah — 
y/Srs ~ 260Mpc), photons perform a random walk, traveling a typical distance 
Ad — V Age At. The number of scatterings, Ngc, can be estimated by comparing the 
comoving distance a photon could have traveled in a Hubble time with the mean free 
path, such that at recombination the diffusion scale is Ad — — 2 Mpc |442) : 

this is about 75 times smaller than r^. A more detailed estimate gives Ad — 7Mpc 
or ^D = AJ/^ ~ 0.14Mpc~^ (see below) for the damping scale at recombination, 
but the qualitative picture does not change. At earlier times, the damping scale 
was even smaller and perturbations at fc > A^d were efficiently erased sourcing CMB 
spectral distortions and starting the thermalisation process. 

A detailed treatment of the dissipation problem in second order perturbation 
theory was carried out by [391j . There the physical picture for the creation of 
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the distortion was clarified, demonstrating that the total energy conversion is 9/4 
larger than from the classical estimates |388H390] and that only 1/3 of this energy 
actually sources a distortion, initially appearing as y-type distortion. The explicit 
expression for the effective heating rate from scalar perturbations, including second 
order effects, gauge dependence and all photon multipoles, reads |391L 144411450] : 


d(Q ac /Pt) 

dz H 


dk 

2tt'^ 


Pn{k) 


(301 - v)^ 
3 



--02 (00 + 02 ) + ^( 21 ' + 1 ) 0 ? 

e>3 


(160) 


where f = a^NeC ~ 4.4 x 10“^^(1 + z)^ sec“^ is the rate of Thomson scattering, a = 
l/(l+z) the scale factor normalised to unity today and H « 2.1 x 10“^° (l+z)^sec“^ 
the Hubble expansion rate Here, 0^ and 0? denote the photon temperature and 


polarisation transfer functions and v the one for the baryon velocity. Additional 
corrections from polarisation terms have been neglected, but do not contribute at 
any significant level during the tight-coupling era of interest to us |457j . 

It was shown that in the tight-coupling regime {z > 1000), the temperature 
quadrupole anisotropy is most important |391j . Effects of baryon loading, bulk flows, 
and photon multipoles with £> 2 can be neglected. This greatly simplifies the 
expression for the effective heating rate, for which only the transfer functions for 
02, 00 and 02 , determining the shear viscosity of the photon fluid |46HI462j . really 
matter. These transfer functions can be computed using standard Boltzmann codes, 
like Camb |463j . and are also included as part of CosmoTherm. For a single mode, 
these oscillate rapidly, so that the heating rate is a function of time [e.g., Fig. 1 of 
reflecting the fact that dissipation is effective at different phases of the wave 
propagation. Dissipation is furthermore most effective where the gradients of the 
temperature field are steepest. 

Using the tight coupling approximation [464] . one can write t' 02 ~ 
where t' = ra/c and 0g -|- 02 — | 02 , so that 




d(Qac/ P-y) 

dz 


4aT 

'1^ 


'k'^dk 

2tt'^ 


P, 


K 


( fc )^02 


4c^ 

Hot., 


• dfc 

27r^ 


k^Pn{k)^Ql (161) 


15 


For adiabatic initial conditions, one has 0i(2:, k) k, A (cg/c) sin(A:rs) e“^ ^ where 

Cg « c/y/i is the sound speed, Vs — f Cgdr/ « CsP « 2.7 x 10^(1 -I- z)~^ the sound 
horizon and A ~ [1-|-(4/15)i?,y]“^ ~ 0.9 with neutrino loading = Pu/{Py+Pv) — 
0.41. Neglecting baryon loading, the damping scale, ku, follows from first order 


ixxvrpj^g approximations for f and H are only valid during the radiation-dominated era. 
lxxvi]vjQ^-g Q 3 J. definition = (2£ -|- 1)©^. 
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perturbation theory and is determined by [e.g., see 14641 [465] 

„ , _9 cl 16 8c^ 

Otk 


'tft-D 


2a^f 15 450^7"’ ( ® ) 

where the factor 16/15 includes polarisation effects [462j . In the radiation dominated 
era, one has « 4.0 x 10“®(1 + Mpc“^. Inserting this into Eq. (161), we find 

d(Qac//07) 4^2 


dz 


c; 16 f ,,, . o/, \ - 2 k^/k^ 


-AA 


/ ^Pnik)sin\kn)d,e-^'^"^^i. (163) 

From this it is straightforward to show that for a single mode, most of the energy 
is released at redshift Zdiss — 4.5 x 10® (fc/10®Mpc“^)^'^^ [see. 1315) . Thus, in the 
y-era (z < 10^), modes with wavenumber k ^ 3Mpc~^ dissipate, while in the /i-era 
(3 X 10® < z < few X 10®) we are sensitive to 544Mpc“^ ^ k < few x lO'^Mpc”^. 
For 3Mpc“^ ^ k < 544Mpc“^, the hybrid (y + y + r) distortion is produced, 
allowing us to probe the shape of the small-scale power spectrum. For modes with 
k ~ 45Mpc“^ about half of the energy causes y and the other half a /r-distortion. 

Assuming smooth Pji^k), we can neglect the variation of sin^(fcrs) and simply 
use the time-averaged value ~ 1/2. In this case, the effective heating rate becomes 


d((5ac/ P-y) 


-2A^4- 

dz , 


dfc 

277^ 


Pnik) e 




A'^ 32c2 
Wa ^ 


' f dk ,4,-, ... 

¥ J ■ 


(164a) 


(164b) 


The minus sign is due to the fact that for decreasing small-scale power energy is lib¬ 
erated. With this expression, we can approximate the effective heating rate for adi¬ 
abatic perturbations given the curvature power spectrum P-ji^k). For isocurvature 
modes, a mode-dependent heating efficiency and fc-space weighting, A^ C^ik), 
has to be added |450j . but here we do not consider isocurvature modes in more 


detail. Using the Green’s function method (Sect. 5.2), with Eq. (164) we are thus 


able to compute the distortion and perform parameter estimation. 


For simpler estimates, one can insert Eq. (164) into Eq. (153) to compute the 


effective energy release in the p- and y-eias numerically. Approximating the energy 


branching ratios, l7i(z) in Eq. (154), as step-functions, one can introduce Ic-space 


window functions to directly approximate the p and y-parameters. This approach 
was used by |315| and later refined in |450] . yielding 



Pn{k)W^{k) 

Pn{k)Wy{k) 


(165a) 


(165b) 
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(166a) 


(166b) 

with k = /c/[lMpc“^] and cutoff scale, /cmin — IMpc”^. The cut-off scale is in¬ 
troduced both because modes at fc < lMpc“^ are already tightly constrained by 
CMB measurements at large scales, implying only very low extra heating to cre¬ 
ate ^-distortions, and because the analytic approximations for the photon transfer 
functions introduced above become inaccurate. 


for adiabatic perturbations, where the /c-space window functions are |450j 
Wf_i{k) « 2.8 exp I — - 
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5.3.4. Simple derivation based on energetics 


We can obtain the approximation for the effective heating rate in another way. 


Equation (164a) simply represents the redshift derivative of the total power inte¬ 


gral, modulated by the damping function. From the discussion in Sect. |5.3~^ we can 
write the average energy density of the photon field as « p^{T)[\ + 6(0^)], 
with the average CMB temperature T = (T), 0 = AT{z,x,n)/T and where (...) 
denotes an universal average over location x and directions n. Then we have the 
total energy density, Q/p^{T) « 2(0^), in the waves that can source distortions, 
and therefore dziQac/P-y) — —‘2'dz{&^)■ If we rewrite 0 as spherical harmonic 
expansion and average over directions under the assumption of isotropy, we find 
(0^) « (0§)-|-3(0f) « (|0oP). Here, we used that 0o and 0i are 7r/2 out of phase 
and that 3|0ip « |0oP- In Fourier space, |0oU — Ae~^ and by comparing 


with Eq. (164a), we can directly identify (0^) ^ f k^P-ji{k)e These 


simple arguments do not capture the full time-dependence of the damping process 
for a single mode (fast oscillations), but give the correct time-averaged heating rate. 
More carefully, one would find dz{Qa,c/P-y) — —29^(0^) = —4(0920), which using 
the expressions for dzQ to first order in perturbation theory again recovers the full 
time-dependence [see Sect. 5.1 of. 1391) . Here, 920 only includes scattering terms 
and the full gauge-independence is restored by also adding second order scattering 
corrections. For general scalar, vector and tensor perturbations, this was recently 
shown by |457j . 


5.4. Constraints on the small-scale power spectrum 

We now have all the ingredients for computing the spectral signal caused by the 
dissipation of small-scale perturbations. Before discussing possible constraints on 
features, we briefly highlight those on the power spectrum, using the standard 
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parametrisation |466j 

Pnik) = 27r2fc-3y4(;(fc//co)”""^+5"™in(fc//co)^ (^07) 

Here, we have the power spectrum amplitude spectral index ng, its running 
rirun = dng/ dlnfc and pivot scale fco- Features are added as a perturbation around 
this background power spectrum and the important question is whether with spec¬ 
tral distortions we could tell the difference. As we argue below, CMB spectral dis¬ 
tortions allow us to put interesting constraints on features caused by particle pro¬ 
duction models, while for oscillatory features the average signal remains too close 
to the signal produced by a featureless background power spectrum. 

5.4.1. Standard power spectrum 

Constraints on the small-scale power spectrum for power law initial perturbations 
(’^run = 0) using CMB spectral distortions were first discussed by |389) . |390j and 
|448) . both for adiabatic and baryon isocurvature modes. Using the early distortion 
limits from COBE/FIRAS and the amplitude of the CMB anisotropies measured 
with COBE/DMR at large scales, is was argued that ng < 1.6 for adiabatic modes 
[MT] . At that time, this was one of the tightest constraints available! 

Those early estimates were based on simple approximations for the energy re¬ 
lease and it was argued that an improved treatment of the dissipation process was 
required to forecast experimental possibilities for PIXIE-like experiments |393j . The 
discussion was extended to cases with running by |443) , but without taking the de¬ 
tailed physics of the dissipation process into account developed later |391j . It was 
shown that a PIXIE-like experiment could detect the /i-distortion at about ~ 1.5tT, 
if the power spectrum is extrapolated from large to small scales assuming ng = 0.96 
and nrun = 0 m- On the other hand, for an experiment similar to PRISM (basi¬ 
cally 10 times the spectral sensitivity of PIXIE) the distortion would be detectable 
at the level of ~ 15tT, which would provide an additional confirmation for the sim¬ 
plest inflation model all the way to A: ~ 10"^ Mpc“^! Simple expressions to estimate 
constraints on A^, ng and n^un from fi and y-distortions are given in [450] for both 
adiabatic and isocurvature modes. 

Generally, additional information from CMB distortions on the standard power 
spectrum (extrapolated all the way to small scales) does not improve the constraints 
on and ng over the CMB anisotropy measurements [e.g., see Fig. 11 of l424j . unless 
much larger sensitivity (> 100x PIXIE) is reached [395]. However, the long lever arm 
between large-scale CMB and the small scales relevant to spectral distortions can 
help improving the limits on running over CMB anisotropies alone [4451144611424] . 
For nrun — 0, the constraint can be improved about ~ 1.7 times when adding CMB 
spectral distortions to future anisotropy measurements expected from an imager 
similar to PRISM, while for nrun — —0.02 the improvement is ~ 20% [4241129] . 

However, from the experimental point of view the small-scale power spectrum is 
much less constrained and a simple extrapolation from large to small scales involves 
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Fig. 20. Icr-detection limits for //, //i, and /i .3 caused by dissipation of small-scale acoustic 
modes for PIXIE-like settings. We used the standard parametrisation for the power spectrum with 
amplitude, spectral index, ng, and running n^un around pivot scale ko = 45 Mpc“^. The heavy 
lines are for rirun = 0, while all other lines are for Urun = {—0.1,0.!} in each group. For reference 
we marked the case rirun = 0.1. The figure is taken from |424| . 


(more or less well motivated) theoretical prejudice. At 3Mpc“^ ^ ^ ^ lO^Mpc”^, 
from the experimental point of view [e.g., 14(17] there are at least two orders of 
magnitude of wiggle-space relative to large-scale constraints. It is thus interesting 
to consider distortion bounds on the small-scale power spectrum independently 
[e.g.. 13151139511424] . Using the standard parametrisation, Eq. (1671, for the power 
spectrum around a pivot scale ko ~ 45Mpc, the Icr detection limits for /x, /xi, 
/X 2 and ^3 are given in Fig. assuming PIXIE-like settings, idealised foreground 
removal and control of systematic effects. Around ns — Ij a detection of ^ is possible 
for A(^ > 10 “®, while > 6 x 10 “® is necessary to also have a detection of /xi. 
In this case, two of the three model-parameters can in principle be constrained 
independently. To also access information from /X 2 and /X 3 one furthermore needs 
> 10“^. In this case, one could expect to break the degeneracy between all three 
parameters, p = {A,j, ns, nrun}, with a PIXIE-type spectrometer. This implies that 
for excess small-scale power, CMB distortion measurements allow placing unique 
and complementary constraints on different early-universe models. A spectrometer 
similar to PRISM could further lower these detection limits, possibly by one order 
of magnitude |424| . 
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5.4.2. Features in the small-scale power spectrum 


Let us now turn to the question of whether distortions can help us to shed light 
on features in the small-scale power spectrum. A broad discussion of non-standard 
small-scale power spectra can be found in |315) and [447] . Here, we focus on fea¬ 
tures caused by particle production models and changes in the sound speed. The 
former scenario primarily leads to localised bumps in the power spectrum [14511147] . 
whereas the latter primarily results in oscillatory modifications (Sect.^. 


For particle production models, [145] provided a simple parametrisation for the 
resulting bump in the primordial power spectrum which we will utilise in the dis¬ 
cussion that follows: 

3/2 

(168) 


/ 7re 

^Pn,i{.k) = 2tt‘^ exp 


TT 

'2B 


From Fig.]^ we see that this simple parametrisation accurately models the generated 
features calculated using heat kernel methods in the regime where back reaction on 
the background fields are negligible (cf. Sect.and Appendix B|. The amplitude of 
the feature, j, is simply related to the value of coupling constant pi between the 
two fields; ~ 1.01 x The derivation for this feature in the primordial 

power spectrum is only valid for 10“^ ^ 9i ^ 1 m. so we are primarily interested 
in i values between 10“^® and 10“®. There are no restrictions on the location of 
the bump; ki is determined by the number of e-foldings between the moment when 
the field becomes massless {(/> — (j>i) and the end of inflation. 


Inserting Eq. (168) into Eq. (164), we find the approximation [315] 


d(Qp,i/ Pj) 

Tz 


9.4a 


e3/2 

•\/67r 


{ki/k-oy 


1 + 


5/2 


(169) 


for the effective heating rate. Examples for two particle production scenarios are 
shown in Eig. The localised particle production feature in the small-scale power 
spectrum causes a burst of energy release at Zp ~ 5.1 x 10® (fci/10®Mpc“^) 
This means that for models with 3Mpc“^ ^ ki < few x 10^ Mpc“^ one can expect 
to be able to constrain both the position and the amplitude of the feature using 
information from the r-distortion. COBE/FIRAS constraints are too weak to limit 
the parameter space, since physical conditions already imply ^ < 10“® [315] . 
However, for PIXIE-like settings one can expect to strongly improve the situation. 

In Fig. we show the expected Icr detection limits for PIXIE-like settings. 
For 80Mpc“^ ^ ki < 3000Mpc“^, the detection limit from p is A^ i ~ 6 x 10“®, 
while in the range 3Mpc“^ ^ h < 250Mpc“^ one may be able to determine both, 
A^,i and ki, if A^^i > 10“^. At scales k > 10®Mpc“^, only p may be detected if a 
particle production feature is present, but the detection limits become weak beyond 


ixxvii Although in the limit of very slowly varying sound speeds, a net injection of power resulting 
in a bump is also possible (cf. Fig.j^. 
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Energy Release for the Standard Power Spectrum with a Bump 



Fig. 21. Effective heating rate for the standard power spectrum with a feature caused by particle 
production during inflation at ki = {20, 400}Mpc~^. For illustration we chose (ns,nrun) = (0.96,0) 
for the background spectrum. Furthermore, we set = 3.5 x 10~^ in both shown cases. The 
figure is taken from EE]. 


k ~ few X lO^Mpc”^. A PRISM-like spectrometer could lower the detection limits 
by a factor of ~ 10, thereby also widening the range over which both A(^^i and ki 
could be constrained individually. 

Another question is whether a small-scale power spectrum with and without 
particle production feature can be distinguished. For this we have to consider the 
shape of the distortion, which is determined by the distortion shape parameters 
Pi = (/^i/A/ri)/(/i/A/i). The ratios Vi = pi/p simply eliminate the dependence on 
the amplitude of the distortion, while weighting by the errors directly gives a sense of 
the sensitivities [424]. Thus, combinations of pi and p 2 define a unique trajectory in 
parameter space which can be compared to the one of the standard power spectrum 
without a feature. This is illustrated in Fig. which compares these cases. The 
eigenspectra of particle production scenarios differ from the simple power law case. 
Thus, a particle production feature at scales 20Mpc“^ ^ h ^ 200Mpc“^ could be 
distinguishable for PIXIE-like settings, if the amplitude exceeds ~ 10“^. 

For comparison, we also show the signal caused by a decaying particle scenario. 
From Fig. 5 of [395] we can see that the shape of the energy release history is 
very similar to the one of a particle production feature. Fig. |21| Consequently, the 
distortion signal is expected to be quite similar, a fact that is reflected in Fig. 
Thus, distinguishing a decaying particle scenario and a power spectrum feature due 
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Fig. 22. l(j-detection limits for {i, fii and on a power spectrum feature caused by particle 
production models for PIXIE-like settings. The figure was obtained with the Green’s function 
method |410| of the CosmoTherm package |393| . 


to particle production will be challenging. However, ultimately this is a question of 
sensitivity and how many distortion parameters can be measured. 

Finally, we briefly consider oscillatory features in the small-scale power spec¬ 
trum, for example, created by changes in the sound speed. As already mentioned 
in mM, the energy release caused by oscillatory power spectrum features usually 
causes a small average effect. For the net energy release, power enhancements and 
deficits cancel out. In particular, for features with rapid oscillations only the aver¬ 
age energy release history over several periods will cause a change of the distortion 
signal with respect to the standard featureless power spectrum. 

One rough estimate for the sensitivity to oscillations in the energy release his¬ 
tory caused by oscillations in the small-scale power spectrum can be found from 
the typical width of the energy release eigenmode functions, determined in |424) . 
Even if the first four r-distortion eigenmodes could be constrained, features in the 
energy release history that are narrower than Az/z ~ 0.2 — 0.3 in redshift, will 
not be resolved. This corresponds to Ak/k ~ 0.2 — 0.3 around k ~ 45Mpc“^ for 
oscillatory features in the power spectrum. For features with higher period, indi¬ 
vidual oscillations will leave an unobservable net distortion signal, unless power is 
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Fig. 23. Trajectories in the pi — p 2 plane for different models. The black line shows the trajectory 
for a feature caused by a particle production scenario for varying ki. For comparison, the red 
dashed line shows a power law spectrum (no running) for different spectral indices, ng, while the 
blue dotted line gives the trajectory for decaying particle scenarios and varying particle lifetime 
tx — [4.9 X 10^/{1+zx)]^ sec. Particle production features produce a distortion that is very similar 
to the one of decaying particles but distinguishable from a normal featureless power spectrum. 


modulated with a longer wavelength. However, given the large parameter space, a 
more detailed study will be left for the future. 


6. Statistical analysis 

Having introduced the most relevant observables, let us now turn to reviewing some 
basics of the statistical analysis of the data. How can one actually tell whether a 
feature is “required” by the data, and how would one go about looking for them? 

The starting point of this discussion is the basic quantity delivered by an exper¬ 
iment measuring an observable O (e.g., the CMB angular power spectrum) - the 
likelihood function /^(Dobsli^th), be., the conditional probability of O taking on the 
observed value Dobs, under the premise that the true value of D is given by Dth- 
In general, Dth is the prediction of a theoretical model Ai for given values of the 
free parameters 6 of A4, and thus we can write the likelihood as a double condi¬ 
tional probability £(Dobs|Dth(0|Af ))■ It should be noted that £ is a function of the 
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data Dobs, not of the theoretical prediction Dth(^l-Ad), so taken just by itself, it 
cannot be used to distinguish between different models (or even different parameter 
values within the same model). 

6.1. Frequentist or Bayesian? 

Given the likelihood function, one now has two approaches to making quantitative 
statements about potential signatures of features in the data. Their fundamental 
difference lies in what quantity is to be treated as a random variable. One can either 
take the functional dependence of the likelihood at face value and treat the data as 
the outcome of a random process (we will refer to this as the Frequentist method), 
or one can take the actually measured data as a condition and relate the likelihood 
to a probability distribution where the theoretical prediction is a random variable 
(Bayesian method). 

Though the issue of which way to follow is sometimes the subject of ideological 
debates among practitioners in the field, we do not presume to take a side here. 
In our opinion, both methods are valid and valuable items in the data analyst’s 
toolbox, provided they are implemented correctly. 

6.1.1. Frequentist approach 

Applied to the problem at hand, the Frequentist approach can be motivated by the 
following line of thought: even if the primordial power spectrum realised in Nature 
was smooth, realistic data will scatter around the theoretical expectation values. 
It is therefore always possible to engineer a more complex (features-)model that 
will fit the data better than a smooth spectrum. Assuming the true spectrum was 
smooth, one can now ask by how much a particular features-model would typically 
improve the fit. And unless the improvement observed for the real data exceeds the 
expectation value by a sufficiently large amount, one should be rather careful about 
claiming a discovery of a feature. 

More formally, the frequentist approach requires the formulation of a null hy¬ 
pothesis iJo (e.g., “the true power spectrum does not have any features”), and an 
alternative hypothesis (e.g., “the true power spectrum has features, described by 
model Ml”), along with the definition of a suitable test statistic 6 (e.g., the likeli¬ 
hood ratio between the best fit featureless model and the best fit of features-model 
All). Then, random realisations of the data can be generated under the assump¬ 
tion that the null hypothesis is true. On each of these random realisations, the test 
statistic will be evaluated, resulting in an estimate of the probability distribution 
P(6). By comparing of P{&) with the actually observed value ©obs, one can ex¬ 
tract the fraction of simulations which yield a more extreme test statistic © > ©obs 
(or © < ©obs), known as the p-value, and gives a qualitative indication whether or 
not the data prefer a feature. 

Unfortunately, p-values are often misinterpreted (see for instance Ref. |468j for 
commonly encountered misconceptions). Let us in particular emphasize that the p- 
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value is neither the probability of the null hypothesis being wrong, nor is 1 — p the 
probability of the alternative hypothesis being true. Also, the widely-spread prac¬ 
tice of calling p < 0.05 “significant” evidence against the null hypothesis is rather 
dubious mi]. In addition, great care needs to be taken when selecting the data 
and defining the test statistic: both choices can bias the p-value and ideally ought 
to be decided upon before having seen the actual data. Focussing on a particularly 
anomalous subset of the data after having inspected them (“a posteriori reasoning”) 
or considering several independent alternative hypotheses (“look-elsewhere effect”), 
for instance, can easily lead to artificially low p-values and premature claims of a 
feature detection. 

Issues like these, and the fact that one has to perform the already numerically 
demanding fitting/likelihood maximisation procedure not only once for the real 
data, but also for a large number of simulated data sets, have given the alternative 
Bayesian ansatz a slight edge in popularity in features searches. 


6.1.2. Bayesian approach 

The idea behind the Bayesian approach is to promote the likelihood function to 
a probability density over the parameter space of a model, or, going even further, 
over a space of different models (see |470j for a detailed discussion of Bayesian 
statistics in the context of cosmology). This goal can be achieved with the help of 
the following relation for conditional probabilities, known as Bayes’ theorem: 


P{B\A) 


P{A\B) ■ P{B) 


(170) 


In a first step, one can apply Bayes’ theorem to the likelihood function and 
obtain the posterior probability density V, 


iP(Dth(e|Af)|£)obs)cl0 


p(0|M)d0-/:(Dobs|Dth(g|Af)) 

f(Dobs|X) 


(171) 


Here, the prior probability density p{6\A4) represents our knowledge about the pa¬ 
rameters independently of the data The Bayesian evidence £ is basically the 


normalisation constant of V and given by an integral of the prior weighted by the 
likelihood over the full parameter space of A1: 


£:(Dobs|7W) = j AO p{e\M)L{Ooi,s\OMM)). (172) 

Invoking Bayes’ theorem a second time and applying it to the evidence yields the 


Ixxviiirpjjg 

prior probability density should encode, e.g., theoretical limitations on parameters, or 
knowledge about the parameters from previous independent measurements. In most cases, however, 
there is no unique canonical prior on 0, and its choice will involve a fair amount of subjectivity. 
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probability of A4 given the data: 


P(M|Dobs) 


p(M)-£(OobslM) 

P(Oobs) 


(173) 


In order to compare two different models A4o and A4i, it is useful to look at their 
model probability ratio, also known as the Bayes factor, 


_ P(Afo|i?obs) _ '£’(ilobs|Alo) 
= P(MilDobs) “ f(OobslXi)’ 


(174) 


where the second equality assumes that the two models have been assigned equal 
model prior probabilities. The Bayes factor has a very straightforward interpreta¬ 
tion: “given the observed data and priors, AJq is Poi times more probable than 
All”. Unlike a likelihood-ratio-based frequentist analysis, it also has the desirable 
property of naturally implementing Occam’s razor principle, due to the averaging 
process in the evaluation of the evidence. This rewards predictive models that yield 
a good fit over large parts of their parameter space and punishes models that are 
overly complex and “can fit anything” and thus provide a good fit over only a small 
fraction of their parameter space. 

Clearly, any physical interpretation of Bayes factors must keep in mind the 
subjectivity of having to specify the prior probability distributions of the model 
parameters. In particular, this also applies to the support of the priors. Whereas in 
Bayesian parameter estimation, parameter constraints are often not very sensitive 
to the range of a parameter covered by the prior, this is not the case for Bayes 
factors. Choosing too wide priors on the parameters of features models will typically 
disfavour them, but Bayes factors can also be pushed in the other direction by 
selecting (too) narrow priors around regions of parameter space with high posterior 
probability. Now this would typically require knowledge of the data, and while it 
is legitimate in the Bayesian approach to adjust priors a posteriori, one should still 
be mindful to not wilfully bias the outcomes in this fashion. 

Finally, the numerical evaluation of the Bayesian evidence can be a difficult 
problem in higher-dimensional parameter spaces. In practice, the nested sampling 
algorithm |471] offers an efficient solution, and has been implemented and refined for 
applications in cosmology in the MultiNest [47211473] and PolyChord [474] software 
packages. 

6.1.3. A note on the use of “sigmas” 

It is not uncommon to find results of a search for features quoted in terms 
of a likelihood ratio or effective between the two models under considera¬ 
tion, sometimes even mapped to the “number of sigmas”, 

V^xis- This is obviously motivated by the case of a 1-parameter Gaussian likeli¬ 
hood/posterior, in which case it can be straighforwardly related to a p-value or the 
Bayes factor, respectively. In multi-dimensional parameter spaces, or when facing 
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non-Gaussian likelihoods, no longer has this straightforward interpretation 

and applying the common intuition formed by the simple case is likely to vastly 
overestimate evidence for the better-fitting model, as we shall demonstrate in the 
example scenario below. 


6.2. A simple example 

Let us illustrate the application of the Bayesian and Frequentist analysis techniques 
with the help of a simple, yet, from the point of view of features-searches, perhaps 
not entirely unrealistic example. 

Consider a data set S, consisting of Np data points Xi, each drawn randomly 
from a Gaussian normal distribution Gp,^a{x) with mean p = 0 and standard devi¬ 
ation (7 = 1. Let us now compare the performance of two theoretical models, the 
“smooth” A4o, and the “features” Afi, defined as: 

Model AJq: 

Predicts Xi = 0 for all i € {1, ...,lVp} and does not have any free parame¬ 
ters. 


Model Ail'. 

Predicts a discrete feature x{i) = Xf ai i = if, and Xi = 0 for the remaining 
JVp — 1 values of i. This model has two free parameters, the position of the 
feature, q, and its amplitude, X[. 


The likelihood functions are simply given by 

Afp 

—21n£o = 

i=l 

iVp 

—21n£i = '^x(i)^ — x(i{)^ -|- (x(if) — X{)^. 


(175) 

(176) 


The ratio of the likelihoods is maximised for Xf = x(inia.x) and i[ = i^ax, where imax 
labels the x with the largest absolute value, and thus we have = a:(tmax)^- 

Of course we know from the construction of S that Aig is the true underlying 
model and that Aii will achieve a better fit to the data only by overfitting the 
natural scatter. Nonetheless, one might want to analyse nnder what conditions Aii 
would actually be preferred. 


6.2.1. Frequentist analysis 

With the null hypothesis that AJq is the correct underlying model and as 

the test statistic, we can generate a large number of simulated data sets Sj and 
construct the probability distribution of the test statistic. This gives us an idea of 
what the typical Ay^ax there is no feature in the underlying model, and also 
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allows us to quantify how large the would have to be in order for us to 

be able to “rule out” the null hypothesis of the smooth model at a given level of 
confidence. 

In this particular case, the probability density function describing the distribu¬ 
tion of can be derived analytically (see Ref. |.J79j for a similar example). 

Noting that the for a single Xi follows a chi-squared distribution with one de¬ 
gree of freedom, /^ 2 , the probability of the maximum Ay^ to not exceed a value of 
y (i.e., the cumulative distribution function of Ay^g^,^) can be expressed as 

P(AyLx <y) = (^(Ay" < (177) 

/ ry \^p 

= [I dy7x?(y')j (178) 

= (erf(vy72)j , (179) 


and thus the probability distribution function of Ay^g,; is given by 


PDF(y,iVp) = rerf(v772 

y'2TTy \ 




(180) 


In general however, the probability density function describing the distribution of 
Ay^ax cannot be so easily derived analytically and will have to be established 
through simulations. For independent random variables Xi drawn from identical 
distributions, and Np > 0(10), it can be reasonably well approximated by a gener¬ 
alised extreme value distribution |475L I476j , 


GF;Rp,.,7Ay^gx) = - 1 + e 


-AyLx-I^ 


-(€+!)/€ r / Ay 2 

^A.max 


exp 


- i + e 


(181) 

with 1 -h ^ (Axis) / (T > 0, but the free parameters of the distribution need to be 
obtained by fitting to the results of a simulation. 

As an example, we plot in Fig. 24 a histogram of Ay^gx obtained from 10® 
simulations, for Np = 1000. Here, the expectation value is (Ay^gx) = 11.9 and the 
97.725%- and 99.865%-quantiled™^ are at Ay^gx = 17.9 and 23.3, respectively - 


much larger than the naive attribntion 2a 


Axlfi = 4 and 3cr 


AXeff = 9 (see 


the discussion in Section 6.1.31. Following our procedure, this case can in fact be 


recovered, but only if A^p = 1 (and using the 1-tail limit definition). 

Increasing the number of data points in the data set will, on average, lead to 
more extreme outliers; therefore it is not surprising that the larger A^p, the higher the 
expectation value of the distribution of and the higher the threshold value 


ixxixpor a Gaussian normal distribution, these quantiles are at a: = /^ -h 2cr and a: = /r -)- 3(T, so we 
will refer to them as the (2-tail) “2 (t”- and “3cr” limits. If one has to deal with a distribution that 
only has one recognisable tail, it may be more reasonable to use a 1-tail definition of these limits 
instead, with the 2cr- and 3(j-limits marked by the 95.45%- and 99.73%-quantiles. This choice is 
motivated by the half-normal distribution, where these quantiles are at a: = 2 (t and x = 3(t. 



























June 22, 2015 


92 



Fig. 24. Histogram of A^max A^data = 10® simulated data sets (Dj with A^p = 1000 data points 
each. The thick black line is the theoretical probability distribution function (Eq. |180[ | and the 
dashed golden line is a fit of a generalised extreme value distribution with parameters (i = 10.83, 
cr = 1.85 and ^ = 0.0096. 


of Axmax required for rejecting the null hypothesis at a given level of significance. 
As can be seen in Fig.[^ both the threshold values and the expectation value grow 
to approximately linearly with In Vp. 

The way we defined the test statistic above, i.e., evaluated over the full 

data set, properly takes into account the look-elsewhere effect. But we can also use 
this scenario to illustrate the danger of a posteriori reasoning: imagine one were 
to first identify the most extreme Xi and then ask how likely it is to have such a 
large deviation from the prediction of the null hypothesis at this particular point. 
One would naturally find a p-value very close to zero, apparently disfavouring the 
null hypothesis. However, choosing to look at the most extreme point in isolation 
is obviously motivated by the fact that one has already seen the data, whereas in 
a blind analysis there would no reason to single out this Xi. So the fallacy here lies 
in ignoring all the other points; one should not be asking about how likely it is to 
observe such a feature at this specific point, but rather how likely it is to see any 
feature at all in the entire data set. 

6.2.2. Bayesian analysis 

Let us assign equal model prior probabilities to Ado ^-nd A^i, and assume the 
following parameter prior probabilities for Adi: a uniform prior on if (i.e., P{ii) = 
1/Vp for all If € {1,..., Ap]}, and a symmetric top hat prior of width /? on Xf (i.e.. 
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Fig. 25. Threshold values and mean of the distribution of Axmax ^ function of the number 
of data points A^p of the data set X) for the frequentist analysis. The golden line indicates the 
expectation value of ^Xmayi- The black and red lines mark the at which the null hypothesis 

is ruled out at 3c7 and 2cr, respectively. Thin lines denote the 1-tail definition, thick lines the 2-tail 
definition of a. If Np > 10, the distribution of the ^Xma:K negligible volume near ^Xma^c — 
and thus the 2-tail definition of cr should be used. At A^p < 10, one might want to prefer the 1-tail 
definition instead; in particular for A^p = 1, this reproduces the naive thresholds of Axmax = ^ for 
2a and Ax^ax “ ^ 


P{if) = 1//3 for if G [—/3/2,/3/2] and P{xf) = 0 if |(xf)| > /?/2. Then the Bayes 
factor between models 0 and 1 is given by 


^01 = 




^ dxG^^iixi) 


p i= 


2=1 


Go,iixi) 


(182) 


which can be straightforwardly evaluated numerically. 

In figure]^ we plot the which the Bayes factor Bqi takes on values of 

These results depend on the choice of the prior width /3; the Bayes 


1 , 


and 


0 - 4.5 


factor would be maximised in favour of AIi, if the prior on X{ was a Dirac delta 
distribution centred on the actually observed maximum of the Xi. In this extreme 
(and, admittedly, not very realistic) case, a Bqi of e~^ and would correspond 
to the likelihood ratio at the 2-tail 2tT- and Scr-limits of the normal distribution. For 


our choice of a symmetric top-hat prior, the integral in equation (182) is evaluated 
also over lower-likelihood regions of parameter space, leading to a suppression of 
the Bayes factor compared to the optimal case. This averaging procedure is absent 
in the frequentist analysis, which relies solely on comparing the best-ht values of 
the likelihood between the two models. As a consequence, the required to 
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Fig. 26. Threshold values of Ax^ax S'® s function of the number of data points Np of the data 
set X) for the Bayesian analysis. The grey, pale red and pale golden bands mark the Ax^^x st 
which the Bayes factor lnf?oi reaches 4.5, 2, and 0, for a range of choices of the width of the prior 
probability distribution of Xf. Positive values of InBoi indicate a preference for Aii- The lower 
and upper edges of the bands correspond to choices of = 10 and /3 = 15, respectively. 


reach a given Bayes factor is always greater than the one required to reach the 
corresponding frequentist p-value. 


7. Features searches 


To a certain extent, the prospects of being able to detect possible features of the 
primordial power spectrum depend on how well the featureless reference model per¬ 
forms in relation to the data. Since the release of Wilkinson Microwave Anisotropy 
Probe (WMAP) data |477) . the quality of fit of a power-law ACDM model to CMB 
anisotropy data has not been very good (but not too terrible either), with p-values 
hovering around the 5-10% region This result certainly does not mean that 


power-law ACDM is incompatible with data, but it also leaves room for improve¬ 
ment and has inspired the hope of finding deviations from a smooth power-law 
spectrum. Consequently, many groups have embarked on the quest for features, 
employing a large array of different methods. This variety of methods can broadly 
be categorised into two different, but complementary, classes which we shall refer 
to as bottom-up and top-down. 


ixxxpQj. £ > 30 Planck temperature data alone the p-value is about 17%, and roughly 7% for 
a combination of temperature and B-polarisation Adding the £ < 30 data would likely lead to 
slightly lower numbers. 
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The philosophy behind the top-down approach is to start from a theoretically 
motivated model and fit its predictions to data. Top-down typically introduces at 
most a handful of additional free parameters and can be combined well with both 
Frequentist or Bayesian statistical methods. While it is a great choice for the devout 
theorist, who wants to test their favourite model of inflation, it slightly suffers from 
the fact that its outcomes are very specific to one model, and so may not be the 
first choice for the agnostic phenomenologist. 

This is where the bottom-up approach comes into play which aims to reconstruct 
“ideal” primordial power spectra from the data. Reconstruction methods often in¬ 
troduce a large number of extra parameters and, due to their great flexibility, a 
Bayesian evidence based analysis will generally favour the simplicity of a feature¬ 
less spectrum. Additionally, the reconstructed spectra do not typically correspond 
to any proper physical model and are thus by no means realistic. However, they can 
nonetheless be very helpful in establishing whether the data indicate any significant 
deviations from smoothness (using frequentist methods), and, if so, in identifying 
what kind of features the data would prefer. In this sense, the reconstruction can 
be seen as providing guidance and inspiration to help one find realistic models that 
reproduce the reconstructed spectra’s most salient features. 


7.1. Bottom-up: Reconstruction of the primordial power spectrum 

Even though reconstruction is in principle meant to be a model-independent tech¬ 
nique, one nonetheless needs to decide on a parameterisation of Vnik)- At first 
glance, one might be tempted to expand Vizik) over the observable region of 
wavenumbers in some functional basis, e.g., a Taylor series. However, keeping in 
mind both feasibility of technical analysis and of physical interpretation, it is desir¬ 
able that these parameters do not exhibit strong and complicated correlations with 
each other. This requirement makes for instance a Taylor expansion ansatz fairly 
unpractical beyond the first few terms (though other more suitable basis functions, 
such as wavelets [1731173], or a principal component analysis |48()j are better be¬ 
haved in this regard). 

The majority of reconstruction methods in the literature are based on some sort 
of binning approach, where V-jz^k) is parameterised in terms of a set of amplitudes 
{VTi{kiy\ at a given set of wavenumbers or knots, {ki}, where the non-negligible 
entries of the correlation matrix are typically clustered around the diagonal. In the 
most simplistic case, the {Vniki)} are interpreted as bandpowers and the full power 
spectrum Vnik) is a series of discontinuous steps [48 Ij . Somewhat less unphysical 
spectra can be obtained by interpolating between the {V-Tiiki)}, e.g., using linear 
interpolation [48211484] or splines [485II488] . While the positions of the knots are 
often fixed before the analysis, it is possible to also treat the {ki] as variables [4891 
inni and even use the Bayesian evidence to decide how many additional knots are 
actually preferred by the data [4911155] . 

As long as the total number of free parameters does not exceed a few dozen, the 
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problem is amenable to both maximum likelihood analysis or Bayesian methods. 
However, with a thus limited number of degrees of freedom, the classes of spectra 
that can be obtained from reconstruction is limited as well: features finer than the 
bins defined by the {ki} cannot be reconstructed. There is no fundamental limit 
to how densely V']z{k) can be sampled though; the number of samples in k can 
even exceed the number of data points (e.g., the Cgs) if one is willing to forgo 
the possibility of an analysis based on exploring the likelihood/posterior and treat 


the issue as a deconvolution problem (i.e., an inversion of Equation 128) instead. 
Deconvolution techniques have been applied by several groups to CMB temperature 
data |492H497] . CMB temperature+polarisation data [498H500) and combinations 
of CMB data with large scale structure data |501j . 

In order to get rid of spurious high-frequency spikes that are likely to occur for 
noisy data when the primordial spectrum is oversampled, these methods typically 
involve a smoothing procedure. Effectively, the maximal possible resolution of the 
reconstruction is related to the width of the window functions of the data sets under 
consideration. Note also that the behaviour at the boundaries of the reconstructed 
^-interval can easily be dominated by the first/last data point, potentially leading 
to drastic suppression/enhancement (depending on whether these data points lie 
above or below the featureless model’s best-fit) of the reconstructed spectrum |502) . 
If such a behaviour is undesired, it can be avoided by introducing a likelihood 
penalty, as for instance in Ref. [SOHj . 

A number of these reconstruction methods have been applied to Planck 
data [5041 [T5] and find results in agreement with earlier analyses of WMAP data, 
e.g., those of Ref. m- Several distinctive features have been identified in recon¬ 
structed spectra: 


A cutoff-like suppression of power on the largest scales fc < 5 X 10 Mpc 
This is driven by the fact that, as seen in Eigure [T^ 02 , 0 ^ and C 4 are 


all below the expectation value of the power-law model’s best-fit. While 
dramatic-looking, one should keep in mind that actually only these first 


few multipoles are sensitive to the scales near this cutoff (cf. Eigure 11), 
and receive hardly any contribution from even larger scales, thus favouring 
as little power as possible at very small k. Such a cutoff-like behaviour is 
therefore more of an extrapolation of the spectrum into areas where the 
data are not informative at all, and is also not uncommonly encountered in 
reconstructions from featureless simulated CMB data. 

A dip around fc « 2 x 10“^ Mpc~^ followed by a bump at fc « 4 x 
10“^ Mpc“^, corresponding to a similar pattern in the Sachs-Wolfe plateau 
of extending from 20 < £ < 40. Eor the first-year WMAP data, this 
was shown to be the most important local deviation from a smooth spec¬ 
trum |479| . 

Later iterations of WMAP data as well as Planck data have revealed several 
additional local features, similar in shape to the previous one, but narrower 
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in In k (and therefore only found using methods with a high enough resolu¬ 
tion), e.g., at wavenumbers k « 3.5 x 10“^ Mpc“^ and fc « 6 x 10“^ Mpc“^, 
reproducing structures in the Cf^ data at multipoles £ ^ 450 and £ ~ 800, 
respectively. 


While these are certainly interesting candidates for primordial features, none of 
them have been shown to be absolutely required by the data, neither in the fre- 
quentist nor in the Bayesian sense. In Ref. m, three methods are applied to the 
2015 Planck data: the penalised likelihood reconstruction finds p-values no smaller 
than 0.045, and the spline-reconstruction in the same paper p-values obtains p- 
values of at least 0.11 and the Bayesian evidence-based variable knot-method 


shows no preference for spectra more complicated than a power-law either. 

Since the transfer functions are not independent of the late-time cosmology, it 
is also interesting to ask to what extent constraints on non-inflationary parameters 
depend on the assumption of a primordial power-law spectrum. With present data, 
the constraints on the ACDM model’s late-time cosmological parameters are in 
fact surprisingly stable when one allows a very general shape of the power spectrum 
compared to the power-law case 1505111^ (though in extended models this statement 
may not hold [506) 1. 


7.2. Top-down: Looking for specific features in the power spectrum 

In this approach, the data are tested for the presence of specific classes of theo¬ 
retically motivated features. They can take the form of a parameterisation of the 
primordial power spectrum, or, even more fundamentally, a parameterisation of the 
inflaton potential (which then requires Vn{k) to be calculated explicitly by integra¬ 
tion of the mode equations) Focussing on a particular model, the number of 
additionally introduced parameters in this case is typicall very small. This means 
that the scope of the analysis is not nearly as broad as in the bottom-up case 
and different classes of models would have to be tested case by case, but on the 
other hand, the higher degree of predictivity means that the Bayesian evidence is 
not doomed from the start to favour a smooth power spectrum. Also, compared to 
binned reconstruction methods, there is no inherent limit on the resolution of fea¬ 
tures in this approach, so very sharp or high-frequency features become accessible 
(but the chances of detection are of course still subject to the inherent resolution 
of the data considered). 

Just like for the reconstruction methods, the vast majority of parameterised 
features searches in the literature involve the CMB temperature angular power 


ixxxi Quoted are the smallest p-values from among different data combinations and analysis pa¬ 
rameters. 

ixxxiijt is also possible to combine a parameterisation of the spectrum and the potential; such 
a hybrid scheme has been employed to reduce dependence on the background inflaton potential 
when looking for features in V{<j>). 
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spectrum. The models analysed can roughly be divided into three categories, based 
on their phenomenology: (i) cutoff-like models with a power-law behaviour at small 
and intermediate scales, but a suppression of power on large scales, (ii) models 
with local features joining up two power-law segments, and (iii) models with global 
features, i.e., modifications of the power-law on all scales. 


7.2.1. Models with a large scale power suppression 


The phenomenological attraction of these models obviously lies in the observed lack 
of power at the largest scales, but they are also interesting from a theoretical point 
of view, since the largest scales correspond to the earliest observable moments of 
inflation, potentially carrying information about the onset of a slow-roll phase of 


inflation, as discussed in Section 3.4 Besides using ad-hoc parameterisations (e.g., 
piecewise power-laws, exponential cutoff, step-function cutoff), a range of physically 
motivated scenarios leading to a suppression of power have been explored: examples 
include a non-trivial topology of the universe |508j , initial kinetic domination |224) , 
initial radiation domination [5()9j or a sharp kink in the inflaton potential |1()3] have 
been explored in the literature. 

In the wake of the BICEP2 team’s claimed detection of a primordial tensor 
contribution these types of models experienced a brief surge in popularity 

due to their ability to ameliorate the apparent tension of a primordial power-law 
spectrum with the large-scale CMB temperature data, which is exacerbated in the 
presence of tensors [511] . There was in fact even some indication of a preference over 
a power-law spectrum [512H516j . However, this conclusion had to be revised after 
the re-evaluation of the BICEP2 data using additional information from Planck and 
the Keck array [517] which showed that the H-mode signal originally attributed to 
primordial tensor perturbations can be explained in terms of polarised galactic dust 
emission. The initial kinetic domination cutoff model has been tested against the 
most recent Planck temperature and polarisation data, but both frequentist and 
Bayesian analyses favour a featureless spectrum m- 


7.2.2. Models with a local feature 

Rather than massively deviating from an overall power-law shape of the spectrum, 
the general idea of these models is to improve the fit to the data by matching 
patterns confined to a small range of wavenumbers or multipoles. Physically, such 
an effect can be obtained by the occurrence of transient phenomena between two 
stages of standard slow-roll inflation and realised by many of the scenarios presented 
in Section |3l 

Consequently there is a rich literature of models that have been confronted with 


data. Resonant particle production (Section 3.2), for instance, was looked at in 

Ref. HIH]. 

The mechanism of Section[3.3|was discussed for a phase transition in a multiple 
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inflation model where the effective inflaton mass undergoes a sudden change |1561 
(see also |519LI520j ~). with particular emphasis on exploring alternatives to late¬ 
time ACDM-cosmology [14011521j . Another, more empirical case is a step in the 
inflaton potential parameterlsed by a tanh-function |522) . which has been found to 
match particularly well the 20 < £ < 40 feature of CMB temperature data 
[52511557115251 fmi [525) . with an effective « 10. However, though in the eyes 
of Planck data this model compares favourably with other features models analysed 
in Ref. [15] , it does not outperform the power-law spectrum. 

The Impact of heavy helds (Section 3.11 has been investigated in the context of a 
model with a temporary reduction in sound speed [1281137811527] . Here, the largest 
improvement in the effective was found to come from fitting the £ ^ 800 feature 
in the Planck temperature angular power spectrum, but the total improvement is 
relatively modest (^ 10) for a model with three additional parameters. 


7.2.3. Models with global features 

If the mechanism causing the emergence of features is not transient, but recurring or 
constantly at work, one can expect the resulting power spectrum to deviate from a 
power-law at all scales. A parameterisation of special interest here is the log-spaced 
sine-modulation of the primordial power spectrum. 


'Pnik) = 'P^{k) [1 -I- Q;cos(a;lnA:/fc* -|- (/?)] 


(183) 


where 'P^{k) is a smooth power-law. The modulation amplitude a and frequency lu 
can in principle be functions of k, but are often kept constant. 

From a technical perspective, the analysis of this model is somewhat challeng¬ 
ing: it tends to display a very complex likelihood landscape with many isolated local 
maxima, especially in the w-direction, making it hard to properly sample with con¬ 
ventional Markov-chain Monte-Carlo methods. And, while the numerical analysis 
of features models generally needs the the numerical precision settings of the Boltz¬ 
mann code to be increased (hence slowing down computation of observables), this 
is particularly true for the high-frequency (w > 100) part of this model’s parameter 
space, where the primordial power spectrum starts to require a denser sampling 


than the window functions when integrating Eq. (128). 


Eq. (|183| ) can be used to describe the signatures of trans-Planckian physics (see 
Section [3.5[ Figure[l0| and was first confronted with CMB data in this regard [528F 
1530] . Besides that, it is also a good approximation to the power spectrum predicted 
in the axion monodromy model discussed in Section |3.3[ where the parameters a 
and w can be related to the inflation potential. Analyses of different incarnations 
of the WMAP data have yielded effective of up to 20 for the monodromy 
model [1931153111169L1532] . For Planck data, however, the Ay^ is slightly lower [5331 
|5M1 and no compelling statistical evidence for a presence of oscillations has been 
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With the advent of Planck data, it has become possible to meaningfully go be¬ 
yond the CMB power spectrum and extend the search for features to the bispectrum, 
as well as performing combined analyses |379L I380j . The Planck bispectrum data 
have been compared to a number of bispectrum features templates, including both 
logarithmically and linearly spaced oscillations, albeit with a reduced frequency 
range compared to the power spectrum [5371115] , and it was shown that no single 
of the analysed features was preferred over a Gaussian model after correcting for 
the look-elsewhere effect. 


found [S351EM1IS5] 


Ixxxiii 


8. Concluding remarks and outlook 


As far as the prospects for cosmological observations are concerned, the present 
era represents a watershed. Not only might we be one mission approval away from 
an ultimate CMB measurement (e.g. [333] 1 targeting both polarization and spec¬ 
tral distortions, the Hubble volume in which we find ourselves constitutes a finite 
region within which it is conceivable that a large fraction, if not all large scale 
gravitationally bound structures will be mapped one day- possibly even within 
the lifetime of someone reading these words at the time of publication. Comple¬ 
mentary 3-d (i.e. uncompressed) information about modes up to comoving scales 
commensurate with those seen in the CMB will be extracted from LSS surveys (up 
to k = 10“^Mpc“^) and over hitherto unobserved scales through 21 cm observa¬ 
tions (up to fc = lO^Mpc”^ ) and observations of CMB spectral distortions (up 
to k = 10"^ Mpc“^'We will thus potentially be privy to a vast amount of 
untapped information about primordial physics, provided it can be appropriately 
deconvolved from these observations, given we know what qnestions to ask of it. 

As this review has attempted to highlight, features, if realized in nature, would 
offer a unique, discriminating window onto the microphysics that underlies the 
cosmological standard model. Although couched in terms of correlation functions, 
simple linear response theory allows us to extract new characteristic scales from 
features in certain contexts, enabling us to infer appropriately limited information 
about the parent theory (from the EFT point of view) in which the mechanism 
that generated structure is embedded. After surveying the various theoretically 
well grounded mechanisms through which features can be generated in correlation 
functions of the comoving curvature perturbation in the context of (adiabatic) in¬ 
flationary cosmology, we reviewed the various ways these could be traced through 
different cosmological observables. These include the anisotropies and spectral dis¬ 
tortions of the CMB as well as (at later times) the matter power spectrum and 
bispectrum inferred through LSS and conceivably, future 21cm surveys. For the lat- 


ixxxiiiXhis also holds true for linearly-spaced modulations of ’P'jzlk), which were examined in 
Refs. 1536113^ . 

ixxxixjn context of inflationary cosmology, this corresponds to being granted a view of another 
10 e-folds of inflation at work beyond the 6-7 that we see in the CMB anisotropies alone. 
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ter observations, much remains to be understood in terms of extracting quantities 
from observations that can be compared to theoretical predictions (in addition to 
the theoretical predictions themselves) at the scales and redshifts of interest, further 
investigations into which this review hopes to spur. 
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“So wie es ist, bleibt es nichV’ 
- Berthold Brecht 


Appendix A. A review of the ‘in-in’ formalism 

In connecting fundamental theory with late time cosmological observations, a cen¬ 
tral object of interest is the finite time correlation function of a particular operator 
(or string of operators) of some primordial field- {0{t)). The latter signifies a finite 
time correlator in a given quantum state (typically taken to be the Bunch-Davies 
vacuum state) at some initial or asymptotic time, unitarily evolved forward. This 
is to be contrasted with the matrix element of some time evolved initial state with 
some final state in the asymptotic future i.e. an S-matrix element, with which the 
reader might be more familiar. In the interaction picture, operator products of fields 
evolve by their free field equations of motion (and so admit an expansion in terms of 
the usual creation and annihilation operators), and states that are evolved forward 
in time by the unitary evolution operator: 

C(t,to) = Texp J7/(T')dry (A.l) 

where T denotes time ordering, and Hj denotes the totality of terms in the in¬ 
teraction Hamiltonian. Hence the operator expectation value {0{t)) is actually 



June 22, 2015 


102 


shorthand for 


(0(r)) = (0i„|f 


exp [ i / iJ/(r')dr' 
'to 


0{t)T expy-ij iJ/(r')dT'j |0in) 

(A.2) 

where T denotes anti time ordering- the result of taking the adjoint of a time ordered 
operator on the left hand side. One can view this as having time evolved from the 
initial time tq to time t, inserting the operator 0{t) at time r and then evolving 


back to time tq. Therefore one can equivalently consider (A.21 as the product of 
0{t) with the unitary operator: 


(0(r)) = (0i„|Tc 


exp ^ iJ/(r')dT' 


O(r)|0in), 


(A.3) 


where the contour integral now goes from rg —1 r and then back from r —i rg, as 


illustrated below- The expression (A.3) is merely a fancy way of rewriting (A.21, 


♦> 




+« 


Fig. 27. Contour used in evaluating ^ig. 28. Contour for evaluating the S- 

matrix 


and each represents an equivalent way of doing the same computation. What is 
important to realize is that because of the anti-time ordered evolution operator 
on the left of 0(t) in the expectation value in (A.21, any diagrammatic expansion 
of the above is rather more involvec Furthermore, were one to evaluate the 
above diagrammatically, great care would be needed, as not only are the associated 
Feynman rules more involved, one also has to account for the many cancellations 
among the relevant diagrams that are not immediately obvious from the above 
form. A more convenient representation of (A.2) that avoids this potential issue is 


ixxxvpQj. comparison, we recall that a typical S-matrix element is given 
by {out|T [exp ///(r'jdr')] | in), where the comparable states in the expectation value 

| |A.2[ | would be the out and in the in vacua respectively (note that these do not have to be the 
same in an interacting theory). The contour one makes in evaluating the S-matrix is indicated in 
figure 
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equivalently furnished by 

OO 


n=0 


' dT„ / 

To J To 


dT„,_i 


[Hi{t 2), ...[Hi{Tn),0{T)]...]]) 

(A.4) 


One can prove that (A.4) is equivalent to (A.2) inductively by showing that if up 


to order N, the time derivatives of (A.4) and (A.2) up to order are equal, 


then they will also be so at + 1*^ order [44]. One is free to work with either a 


diagrammatic expansion of (A.2) or the operator summation (A.4) as convenience 


dictates. For the purposes of this review, the operator evaluation (A.4) suffices. For 


a review of the diagrammatic approach- which comes in handy when evaluating 
higher order or loop corrections to certain observables- see |538j . 


Appendix B. Effective actions and particle production 

Consider a heavy field ij} coupled to a light field (p that represents the inflaton: 

s = J + J + ... (b.i) 

where for simplicity, we do not consider derivative couplings at quadratic order and 
ignore higher order terms in the heavy fielc^^^^^™ Integrating out the heavy field 
results in the effective action 




Dip e 




l[det(-n + M2[(^])]-i/2, 

(B.2) 


with given by the first term in (B.I). Thus 


W = J ^ 4 > 04 > - VinfiP) + ^Trln(-n + [(/)]). (B.3) 

As we demonstrate via the heat-kernel method in the next subsection, if is in¬ 

dependent of (j), the functional determinant is straightforwardly evaluated resulting 
in an effective action with the effective potential 


yesiP) — Vinf + y^ + 




In 


M^/iy 


(B.4) 


647r^ 

where Vet are the usual infinite renormalizations that we absorb into the bare cou¬ 
plings of the action and A is the mass scale introduced by regulating the loop inte¬ 
grals. This is the Coleman-Weinberg (CW) |539j corrected effective potentiab’'^’'''” 


More generally, we can rewrite (B.3) recalling (B.2) as 


W = f ^/-g - ViniW 


— i In Z, 




(B.5) 


ixxxvirpj^g latter can formally be accounted for by introducing sources and taking appropriate 
functional derivatives of the resulting generating functional. 

ixxxviiy^^jQ.^jj^g f-Qj. (derivative couplings between (p and xp and for M^(0) to depend arbitrarily on 
(p will in general result in corrections to (B.3i that constitute the usual derivative expansion. 
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with defined as 


Z^ = J Vijj e-5/’/'(-n+M=[0])v<^ 


(B.6) 


The resulting quantum corrected equations of motion are obtained by varying the 
effective action functional W with respect to </>, which gives 




where it should be clear that 




J Vip ijj 


2 


f Vip e“5 


(B.7) 


(B.8) 


and where it should be emphasized that the right hand side of (B.7) is a corre¬ 


lation function of two coincident fields, evaluated at a fixed spacetime point and 
thus suitably regularized. The subscript on the expectation value denotes that it 
will in general be a functional of (p and its derivatives. If in the asymptotic past 
M‘^[(p] —1 M^, where M characterizes some constant heavy mass scale, and if at any 
finite time M'^\(p] only ever differs from perturbatively, then the above can be 
straightforwardly evaluated via the Schwinger-Keldysh, or in-in formalism |54()II54J] 
(see above). If the initial state of the system was in the adiabatic vacuum of the ip 
field, then the net effect of time evolution in the interaction picture (effected by the 
Dyson operator) will be to evolve the vacuum into a state with finite occupation 
number mode by mode, described by mode functions related to the initial vacuum 
via the Bogoliubov coefficients au and pik- 


'^k — T P^k'^k 


(B.9) 


Given that such evolution will not excite modes of arbitrarily high energy, the Dyson 


operator corresponding to (B.9) evaluated at that moment is given (up to a phase) 


by the equivalent unitary operatoi 


.Ixxxviii 


[/(©) = (B.IO) 

where the and ak are the creation and annihilation operators associated with the 
vacuum of ip in the infinite past, and where 0^ := relates to the Bogoliubov 


coefficients of the transformation (B.9) as ak = cosh0fe, pik = e sinh0/j. From 


this, we straightforwardly evaluate recalling that ip evolves as it would in the 

Heisenberg picture as 

1 f d^k 

(27r)3 J 2ujk L" 

(B.Il) 

with iJj. = k'^+ . The first term in the square brackets above results in the usual 

divergent contribution of evaluating a two point correlation function at coincident 




1 + 2\akPk\cos{25k) + 


ixxxviiip^gq^^j.jj^g Hilbert spaces spanned by the mode functions on either side of i B.91 be 

unitarily equivalent implies that the must tend to zero sufficiently fast for large k. 
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points. This term contributes the usual renormalizations of the couplings of our 
theory and the effective potential. That is to say, bringing this contribution over to 


the left hand side of (B.7), when M'^{(j)) varies slowly enough we find the usual CW 

correctiorli^^^i^ 


d^k _ 

4(27r)3 J a/P~+M 2 327r2 


In MVA^ = (B-12) 


which evidently captures the variation of the vacuum energy density of the ■i/' field 


along the inflaton trajectory. The second term in (B.lll corresponds to a phase 
associated with each excited wave number. The so called ‘random phase’ states 
[551 [57] (such as thermal states or eigenstates of the number operator) contribute 
vanishingly. More generally, this contribution will be negligible compared to the last 
term. Thus we are left with 

d^k 


= J 


nk 


= l_ 

Wfc (27r)3 ~ J LOk (27r)3 


(B.13) 


where is the number density of particles with comoving momenta indexed by 
k. Evidently then, our task in evaluating additional contributions to the one loop 
effective action on top of the usual CW contributions boils down to accounting 
for all relevant contributions to particle production of the heavy quanta. These 


contributions, which sum up to (B.13), will in general not only depend on (j), but 


also its velocity, acceleration, etc. In general these terms will resum to the usual 
derivative expansion we are accustomed to. We now discuss several ways by which 
one might calculate these contributions, beginning with a brief introduction to the 
method of the heat kernel. 


Appendix B.l. The effective action via the heat kernel 

The (Euclidean) effective action for a light scalar (j), coupled to a heavy scalar ijj 
can be obtained by integrating out the field '(/'■ In the event that ip appears in the 
action quadratically, the contribution to the effective action is given by 


c-w[<p] _ 


= [det(-n + m2((/)))]-i/2. (b.14) 


Thus 


1 11 poo 1 

W = -In det[—□ +(()>)] = -Tr ln[—□ +(()))] = x / —Tr ((^)]^ 

2 2 2 Jq s 

(B.15) 

where the latter equality arises from the relation: 

ds 


lim 

e 2->.0 


e-"’" = lim -Ei[-a:e^] R:iln[x], 




(B.16) 


ixxxix^g drop all unphysical power law divergent terms in making this comparison. 
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where irrelevant constant terms have been discarded in the above. At this point, 
there are several formal paths one could embark upon to actually compute the effec¬ 
tive action, each with its own conceptual appeal. In this subsection, we summarize 
some standard results obtained via the rather versatile heat kernel methods (see 
for an excellent review, and the generalization to interacting fields). 

To begin with, we first note that the matrix operator that we are tracing over 


in (B.15) can be written in position space basis as: 

G{x,x';s) := ^'^^^\x'). 

This function is clearly a solution to the p.d.e.: 

[9s — -I- m'^{cj))]G{x, x'\s) = S(s)S^(x, x'). 


(B.17) 


(B.18) 


which allow us to identify it as the heat kernel for a 5-d spacetime with ‘time’ 
identified with the s direction, with unit diffusion co-efficient, and the Euclidean 
dimensions identified as the spatial dimensions. In the case that is a constant 
(or if d(j,rn?{(f)) is smaller than any other mass scale in the problem), we can solve 
the above directly in Fourier space and transforming back, to obtain: 


G{x, x'] s) = 0{s)- 






(B.I9) 


Therefore, if the characteristic time scales associated with the dynamics of (p are 
much longer than that of tp (i.e. it is a much lighter degree of freedom), we can 
approximate above as a constant and by using (B.19), compute the effective 

action, regulating the divergent lower limit of the integral over s via the regulator 
= 1/A^ (where A is some UV mass scale): 



^ J d^x (a:|e-"[-°+™'('^)l|a:) 



ds 




(B.20) 


The integral over s evaluates (to leading divergences) as: 



A^ — TO"‘((/))ln^m^(0)/A^^ 


(B.21) 


and so the one loop correction to the potential for the p field Vinf{(p) results in the 
effective potential: 


Ves{4>) 


Idnf + Vet + 


Tn‘^{(p) 

647r^ 


In 


rn^ {(p) / A? 


(B.22) 


where I4t contains the standard counter terms one has to add to the inflaton po¬ 
tential Vinf in this renormalization scheme. This is the standard Coleman-Weinberg 
effective potential, and the consequences of this correction for the predictions of in¬ 
flation have been explored in detail in [65j . The derivative corrections to the above 
that would result if is no longer assumed to be a slowly varying function of 
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we result in the standard effective field theory (derivative) expansion that is dis¬ 
cussed in the context of inflation in [66j . In certain special cases as exploited in this 
paper, it is possible to compute the leading order contributions to the functional 
determinant for non-constant m?{(j)) exactly. 


Appendix B.2. Particle production via the heat kernel 


As an illustration of the heat kernel method, we discuss a canonical example that has 
as a limit, an example that has previously been studied by other methods |1J7] |144] . 
We consider a light held (j) coupled to a massive scalar held tp: 


1 1 

C = - —- ^{4'- 


(B.23) 


Our task is to compute the effective action (B.14) around the background trajectory 


of the light held. If (p corresponds to a slowly rolling inhaton, the potential (B.23) 


results in the following Euclidean mass term (tachyonic, for small enough M^) for 
the helcf^ 

— A^0q(t — r*)^, (B.24) 

where (po denotes the spatially homogeneous solution, and cpo is the held velocity 
so that the background solution when -ip = 0 is given by <po{t) = 4ot = —i(poT- 


According to (B.14) and (B.15), we are then required to evaluate 


W[</>] = J 




(B.25) 


where we have set r* = 0 for simplicity. In spite of the non-trivial spacetime prohle 
of the inhaton held, the operator trace we have to evaluate turns out to have a 
closed analytic expression in the limit we ean consistently treat (po as an external 
baekground An analogous operator trace arises when comptuing the ehective 

action for Dirac fermions in a constant background electric held |545j |54fij . 

In proceeding, we hrst take note of an alternative prescription to the Schwinger- 


Keldysh, or in-in approach towards computing (B.7), where the two point correlation 


function of interest is evaluated by evolving initial states from the asymptotic past 
to some hnite time. If we were instead only willing to compute the time evolution 
operator along the contour from the asymptotic past to the asymptotic future (as we 
do when we compute S-matrix elements), we could still calculate the contribution 
to the effective action due to particle production from the imaginary part of the 
effective action: 


(OoutlOin) 


(B.26) 


^ gi[Re(iy)+iIm(W)], 


^^Since particle production, if it occurs at all, will happen in a very brief window around 0 = 0* 
we adopt the common |137| |144| approach of approximating de Sitter space with Minkowski space 
in what follows. 

when we can neglect backreaction on the background trajectory that may arise, for example 
from particle production. 
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and so therefore 


|(0out|0i„)P = e-2i“W. 


(B.27) 


In words, if the adiabatic conditions are met throughout, the in vacuum will evolve 
to the out vacuum. If these conditions are not met even at localized events, the in 
vacuum will have evolved into a state which is populated as far as the out vacuum 
is concerned, and the two states will have an overlap that differs form unity. The 
difference of this overlap from unity tells us exactly how many excited quanta have 
been created per spacetime unit volume, and is evidently given by the imaginary 


component of the effective action (|B.25| thus computed. To evaluate this in the 

(B.28) 


context we are interested in, consider the relevant functional trace: 


we recognize in the above that the first two terms group into the Hamiltonian of a 
harmonic oscillator in an inverted potential [546] : 


tr e 




with the identification 


H = - v^A^0nT" 


(B.29) 

(B.30) 


2m ' 2' V - ■ 

so that the mass of the oscillator is to be read off as m = \/2^/s and the spring 
constant as fc = —2y/s\’^4>^, resulting in the imaginary frequency w = 2i\(j3Qy/s. We 
can immediately evaluate this sub-trace to yield: 


= tr e 


__ 


= —i tr e 


where the sum is straightforwardly evaluated so that 


-V5a;[ata-|-i] _ ^ g- 


n=0 


2iX4)Qs{n-\-^) 

(B.3I) 
(B.32) 

2 _ Q—2iX<pos 

where the factor of —i appears from the Jacobian in transforming to the ladder 
operator basis. The remaining factor in the operator trace is easily evaluated: 


tr e 




^—iX(j)QS 


tr = 


d^p 


(27r)3/2 


sp _ 


I 


(2s)3/2’ 


(B.33) 


so that we are left evaluating: 


ds 

( 

' _ 'I '\n+l 


/o S (2s)3/2 X _ g2iX^os 

2 ^ 
n—l 

2n ' 

\2n7r/ 


1 

2 

^ “ n— I 

(B.34) 

where the latter follows from the standard principal value evaluation of the integral 
which has poles along the positive s axis [545]: 

lim - = P —h TriS(x). 

£->■-1-0 X — ie X 


(B.35) 
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It is interesting to leave the functional trace in (B.33I undone to obtain the equiv¬ 
alent expression: 


Im W = Im - 
2 , 

= 1 ^ 


f d^P 

' (27r)3/2 

d^p ^ 


ds 


-s{p'^ + M^ 


(27r)3/2 
^ ' n—1 


-n7r(p^+M^)/A<^o 


^iX^QS 

^ _ ^2iX<pQS 

(_i)"+i 
^ ’ 


(B.36) 


the truncation to n = 1 of which, is to be compared to (A.8) of HiTTp^ The form 
above permits an easy interpretation in terms of the spectrum of the ^ quanta 
created. We start by noting that the pair prodnction probability per unit spacetime 
is given by 

d^p 


F/V.J 


-Mp)\\ 


(B.37) 


(27r)3/2 ' 

where j3{p) is the Bogobliubov coefficient of the created mode p. However we note 


from (B.27) that the quantity (B.36) has an identical interpretation: 

|(0out|0in)|" = = 1 - 2Im(W) ^ P = 2Im(W) 

Therefore we can read of the spectrum of created particles as: 


|/3(p)p = ^ )/>'<t>o . 


n—1 

r2 


2n 


(B.38) 


(B.39) 


We immediately see that when A^o, particle production is heavily suppressed. 

However when —>• 0, we see from (B.23) that ip has a effective mass everywhere 

except when (/> = 0*, at which point ip is effectively massless resulting in copious 
particle production at that instant, quantified as: 


OO 

|/3(p)|2 = 

n—1 


(- 1 ) 


n+1 


2n 


(B.40) 


the n = 1 truncation of which was studied in [14411147] as a means to generate 
features in the power spectrum as well as to generate non-trivial shapes for the 
CMB bispectrum. In [137] particle production at such ‘enhanced symmetry points’ 
was suggested as a means to trap rolling moduli that approach such pointsFor 


completeness, one could evaluate the sum in (B.40) and perform the subsequent 
momentum integral for small velocities 




(B.41) 


We note that our derivation is strictly only valid in the limit when particle produc¬ 
tion does not appreciably backreact onto the trajectory of (pQ. Were it to do so, we 


to (A. 10) in the same reference, as originally derived in 
|91| for how massless string states consistently stabilize moduli at such points. 
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would have invoke an iterative procedure in accurately estimating the imaginary 
part of the effective action. However at weak coupling, the first order treatment 
compares very accurately to lattice simulations Ell- 


Appendix B.3. An analytic example 


We consider the potential (831 in which a heavy direction suddenly becomes frac¬ 
tionally lighter for a brief window 

Vi(j),'ip) = Vintifl)) + 1 - A^sech^[((/)- 


(B.42) 


In the above, /i is some mass scale that characterizes the skew field gradient of 
H((?!),'0), T^nf is a potential which admits slow roll, and A is a dimensionless param¬ 
eter. 

If is greater than any other scale in I^„f, then classically (i.e. at tree level), 
after the infiaton (f) has settled on it’s attractor, it will roll along the trough of 
these potentials uninterrupted as -i/; will be in its ground state. If there are no bends 
in the trajectory, tp remains in its ground state throughout. However at one loop 
we know that this will no longer be the case- the zero point energies of the heavy 
modes change as the effective mass of ip changes. This change manifests adiabatically 
through the variations of the parameters of the CW potential and non-adiabatically 
through the particle production contribution to the effective potential (respectively. 


the first and the third terms of (B.Il)). 


We are interested in computing the contribution (B.131 for (B.42). From this, we 


can readily compute corrections to CMB observables that result from contributions 
to the effective potential due to particle production. We first note that the equation 


of motion for the ip field that results from (B.42) is given by: 


ip + M^ 


1-f 


A2 


M2 cosh^)^^] 


ip = 0, 


(BAS) 


where since we are only interested in the behaviour of the field in a brief window 
around (p^ in computing the spectrum of produced particles, we can safely approx¬ 
imate the background as Minkowski in what follows. Furthermore, assuming that 
(p> is slow-rolling, to first order we consider (p — (p^ simply to be (po(t — t*) in the 
above. Certainly, higher order corrections can be entertained, but as we shall see, 
if the backreaction of the produced particles on the background infiaton trajectory 
is negligible, this is a consistent approximation. Therefore, the equation of motion 
for the Ip quanta 


iP + M^ 


M2 


cosh^[^-^ 


-0 = 0 , 


(B.44) 


where A := p./(po. We note that far enough away from 0*, the ip quanta propagate 
as free particles. Particle production would be indicated by a non-trivial overlap 
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between the exact mode functions of (B.43) with those of the corresponding neg¬ 


ative frequency free particles at late times. Computing the Bogoliubov coefficients 
j3k becomes computationally identical to determining the ratio of the reflection and 


transmission coefficients (R/T) of the wave function defined by (B.431 when consid¬ 
ered as a Schrbdinger equation in the presence of the barrier defined by the inverted 


potential in the above. Exact mode functions that satisfy (B.43) are in fact avail¬ 


able, and the precise problem of barrier transmission in the potential defined by 
(B.43) has been studied previously by Eckart |548j . With the appropriate boundary 


conditions (the normalization T = 1, and setting t* =0), we find the exact solutions 

Wfe 1 


lAi.lt') = - - ---I-- 1-i _ 

^[ 22 ’ 2 + 2’^ + 


with 


/3(i) := 


1 


(B.45) 


(B.46) 


^ 2^2 ^ 2^2 

and where Wfc = F{x) is the hypergeometric function 2 Fi{x), and: 

A2 \ 1/2 




(B.47) 


For large t given that E'(O) = 1, we see that we recover an outgoing plane wave with 
transmission co-efficient unity, as per our normalization: 

^ 00 ) ^ (B.48) 

For t —)■ — 00 , one can exploit the transformation formula (with ^ := — 

[MH| : 

1 




Wfc 


1 K 1 K 
2"2’"2 + 2’ 


+ 2’ 


(B.49) 


rl 


1 

[2 

2’ 

2 


-I- 02 


(o^) 


-iiifc/2y^/3(i) pX 

F - — — — i- 


K 




Ik . Wfe . Wfe 

— - -I- - — i -—, I — i 


.(l-C)V ^ L2 2 2 ' 2 

to obtain the past asymptotic form (with ^ —>• 0): 

^|;{t ^ 00) ~ 016*^^“''* -f 026-*“'^'=*, 

and with 

r[l - iCok/y/^]T[-iL0k/ 




(B.50) 


oi = 


02 = 


r[l/2 + ac/2 - iCbk/./^)]r[l /2 - k/ 2 - 
r[i - y^]r[iwfc/ 


(B.51) 


r[i/2-hK/2]r[i/2-K/2] 
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From here, it is straightforward to compute the Bogoliubov coefficients |/?fep = 
|i?/T| = |a 2 /aip as: 

2 1 + cos[7rK] 


cosh[27ra)fc/iy/?(!)] + cos[7r«;] ^ ^ 

We can immediately infer some basic properties of the above by taking some limits. 
Clearly, when A —>■ 0 in (B.42), n = y^l — 4A^//3(i) —>■ 1, and so /3fc —>■ 0 for all k, as 
expected. Next, when taking the A = 0 limit, we find: 


m = 


32(/c2 + M2) ^ 


(B.53) 


which demonstrates as expected the preferential production of long wavelength 
modes, which disappears once the window A drops significantly below the charac¬ 
teristic time scale M~^ of the ip field. We note that in the above k can be imaginary 
as well, where in this case the identical expression would result except with |ft:| in 
place of K |548j . A particularly transparent special case is when /c = 0, i.e. when 
A = 1/2AM: 


1 -I- cosh 


7ry'(l-|-fe2/M2) 

A 


(B.54) 


Where again —>• 0 as A —>■ 0, and modes with wavelengths shorter than M~^ are 
preferentially produced. Here and in its other guises above, we notice how particle 
production appears as a non-perturbative result (vanishing faster than any power 
of the the perturbative coupling, hence inaccessible to any perturbative expansion 
[S3S])- as had been remarked upon previously |549| |546j . We plot the number den¬ 
sity of produced i/' quanta at (/)* in fig[^ given through (B.13I. 
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